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ARTIFICIAL ANTIBODY POLYPEPTIDES 

5 Portions of the present invention were made with support of the United 

States Government via a grant from the National lnstitutes of Health under grant 
number GM 55042 The U.S. Government therefore may have certain rights in 
the invention. 

10 FIELD OF THE INVENTION 

The present invention relates generally to the field of the production and 
selection of binding and catalytic polypeptides by the methods of molecular 
biology. The invention specifically relates to the generation of both nucleic acid 
and polypeptide libraries encoding the molecular scaffolding of a modified 
15 Fibronectin Type III (Fn3) molecule. The invention also relates to "artificial 
mini-antibodies" or "monobodies " z.e., polypeptides containing an Fn3 scaffold 
onto which loop regions capable of binding to a variety of different molecular 
structures (such as antibody binding sites) have been grafted. 

20 BACKGROUND OF THE INVENTION 

Antibody structure 

A standard antibody (Ab) is a tetrameric structure consisting of two 
identical immunoglobulin (Ig) heavy chains and two identical light chains. The 
heavy and light chains of an Ab consist of different domains. Each light chain 

25 has one variable domain (VL) and one constant domain (CL), while each heavy 
chain has one variable domain (VH) and three or four constant domains (CH) 
(Alzari et aL, 1988). Each domain, consisting of - 1 10 amino acid residues, is 
folded into a characteristic p-sandwich structure formed from two p-sheets 
packed against each other, the inomunoglobulin fold. The VH and VL domains 

30 each have three complementarity determining regions (CDRl-3) that are loops, 
or turns, connecting p-strands at one end of the domains (Fig. 1 : A, C). The , . 
variable regions of both the light and heavy chains generally contribute to 
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antigen specificity, although the contribution of the individual chains to 
specificity is not always equal. Antibody molecules have evolved to bind to a 
large number of molecules by using six randomized loops (CDRs). However, 
the size of the antibodies and the complexity of six loops represents a major 
5 design hurdle if the end result is to be a relatively small peptide ligand. 

Antihodv substnictures 

Functional substructures of Abs can be prepared by proteolysis and by 
recombinant methods. They include the Fab firagment, which contains the VH- 
10 CHI domains of the heavy chain and the VL-CLl domains of the Ught chain 
joined by a single interchain disulfide bond, and the Fv fi-agment, which contains 
only the VH and VL domains. In some cases, a single VH domain retains 
significant affinity (Ward et al, 1989). It has also been shown that a certam 
monomeric k light chain will specifically bind to its cognate antigen. (L. Masat 
15 etal.,\ 994). Separated Hght or heavy chains have sometimes beai found to 
retain some antigen-binding activity (Ward et al, 1989). These antibody 
fragments are not suitable for stiiichiral analysis using NMR spectroscopy due to 
their size, low solubility or low conformational stability. 

Another functional substructure is a single chain Fv (scFv), made of the 
20 variable regions of the immunoglobulin heavy and light chain, covalently 
connected by a peptide linker (S-z Hu et al., 1996). These small (M, 25,000) 
proteins generally retain specificity and affinity for antigen in a single 
polypeptide and can provide a convenient building block for larger, antigen- 
specific molecules. Several groups have reported biodistribution studies in 
25 xenografted athymic mice using scFv reactive against a variety of tumor 

antigens, in which specific tumor locahzation has been observed. However, the 
short persistence of scFvs in the ckculation limits the exposure of tumor cells to 
the scFvs, placing limits on the level of uptake. As a result, tiimor uptake by 
scFvs in animal studies has generally been only l-5%ID/g as opposed to intact 
30 antibodies that can localize in tiamors ad 30-40 %ID/g and have reached levels as 
high as 60-70 %ID/g. 

A small protein scaffold called a "minibody" was designed using a part 
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of the Ig VH domain as the template (Pessi et al, 1993). Minibodies with high 
affinity (dissociation constant (K^) - 10'^ M) to inter! euldn-6 were identified by 
randomizing loops corresponding to CDRl and CDR2 of VH and then selecting 
mutants using the phage display method (Martin et ai, 1994). These 
5 experiments demonstrated that the essence of the Ab fianction could be 

transferred to a smaller system. However, the minibody had inherited the limited 
solubility of the VH domain (Bianchi et al, 1994). 

It has been reported that camels (Camelus dromedarius) often lack 
variable light chain domains when IgG-like material firom their serum is 

1 0 analyzed, suggesting that sufficient antibody specificity and affinity can be , 
derived form VH domains (three CDR loops) alone. Davies and Riechmann 
recently demonstrated that "camelized" VH domains with high affinity (K^j - 10* 
' M) and high specificity can be generated by randomizing only the CDR3. To 
improve the solubility and suppress nonspecific binding, three mutations were 

1 5 introduced to the framework region (Davies & Riechmann, 1 995). It has not 
been definitively shown, however, that camelization can be used, in general, to 
improve the solubility and stability of VHs. 

An alternative to the "minibody" is the "diabody." Diabodies are small 
bivalent and bispecific antibody fragments, le., they have two antigen-binding 

20 sites. The fragments contain a heavy-chain variable domain (V„) connected to a 
light-chain variable domain (VJ on the same polypeptide chain (V^-Vl). 
Diabodies are similar in size to an Fab fragment. By using a linker that is too 
short to allow pairing between the two domains on the same chain, the domains 
are forced to pair with the complementary domains of another chain and create 

25 two antigen-binding sites. These dimeric antibody fragments, or "diabodies," are 
bivalent and bispecific (P. Holliger et al, 1 993). 

Since the development of the monoclonal antibody technology, a large 
number of 3D structures of Ab fragments in the complexed and/or free states 
have been solved by X-ray crystallography (Webster a/., 1994; Wilson & 

30 Stanfield, 1 994). Analysis of Ab structures has revealed that five out of the six 
CDRs have limited numbers of peptide backbone conformations, thereby 
permitting one to predict the backbone conformation of CDRs using the so- 
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called canonical structures (Le^k & Tramontane, 1992; Rees et al, 1994). The 
analysis also has revealed that the CDR3 of the VH domain (VH-CDR3) usually 
has the largest contact surface and that its conformation is too diverse for 
canonical structures to be defined; VH-CDR3 is also known to have a large 
5 variation in length (Wu et al, 1993). Therefore, the structures of crucial regions 
of the Ab-antigen interface still need to be experimentally determined. 

Comparison of crystal structures between the free and complexed states 
has revealed several types of conformational rearrangements. They include side- 
chain rearrangements, segmental movements, large rearrangements of VH-CDR3 
10 and changes in the relative position of the VH and VL domains (Wilson & 

Stanfield, 1993). hi the free state, CDRs, in particular those which undergo large 
confomiational changes upon binding, are expected to be flexible. Since X-ray 
crystallography is not suited for characterizing flexible parts of molecules, 
stmctural studies in the solution state have not been possible to provide dynamic 
1 5 pictures of the conformation of antigen-binding sites. 

Mimipkiny r the an ^H^Hy-hinding site 

CDR peptides and organic CDR mimetics have been made (Dougall et 
al, 1994). CDR peptides are short, typically cycUc, peptides which correspond 
20 to the ammo acid sequences of CDR loops of antibodies. CDR loops are 
responsible for antibody-antigen interactions. Organic CDR mimetics are 
peptides corresponding to CDR loops which are attached to a scaffold, e.g., a 

small organic compound. 

CDR peptides and organic CDR mimetics have been shown to retain 
25 some binding affinity (Smyth & von Itzstem, 1994). However, as expected, they 
are too small and too flexible to maintain full affinity and specificity. Mouse 
CDRs have been grafted onto the human Ig framework without the loss of 
affinity (Jones effl/., 1986; RiechmanneM/., 1988), though this "humanization" 
does not solve the above-mentioned problems specific to solution studies. 

30 

Mimipking n atural selec tion processes of Ab s 

In the immune system, specific Abs are selected and ampUfied from a 
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large library (affinity maturation). The processes can be reproduced in vitro 
using combinatorial library technologies. The successful display of Ab 
fragments on the surface of bacteriophage has made it possible to generate and 
screen a vast number of CDR mutations (McCafferty et al, 1990; Barbas et al, 
5 1 99 1 ; Winter et al , 1 994). An increasing number of Fabs and Fvs (and their 
derivatives) is produced by this technique, providing a rich source for structural 
studies. The combinatorial technique can be combined with Ab mimics. 

A number of protein domains that could potentially serve as protein 
scaffolds have been expressed as fusions with phage capsid proteins. Review in 

10 Clackson & Wells, Trends Biotechnol. 12:173-1 84 (1994). Indeed, several of 
these protein domains have already been used as scaffolds for displaying random 
peptide sequences, including bovine pancreatic trypsin inhibitor (Roberts et al^ 
PNAS 89:2429-2433 (1992)), human growth hormone (Lowman et al. 
Biochemistry 30:10832-10838 (1991)), Venturini et al. Protein Peptide Letters 

1 5 1 :70-75 ( 1 994)), and the IgG binding domain of Streptococcus (O'Neil et al , 
Techniques in Protein Chemistry V (Crabb, L,. ed.) pp. 517-524, Academic 
Press, San Diego (1994)). These scaffolds have displayed a single randomized 
loop or region. 

Researchers have used the small 74 amino acid a-amylase inhibitor 
20 Tendamistat as a presentation scaffold on the filamentous phage M 1 3 
(McConnell and Hoess, 1995). Tendamistat is a P-sheet protein fi-om 
Streptomyces tendae. It has a number of features that make it an attractive 
scaffold for peptides, including its small size, stability, and the availability of 
high resolution NMR and X-ray structural data. Tendamistat's overall topology 
25 is similar to that of an immunoglobulin domain, with two P-sheets connected by 
a series of loops. In contrast to immunoglobulin domains, the P-sheets of 
Tendamistat are held together with two rather than one disulfide bond, 
accounting for the considerable stability of the protein. By analogy with the 
CDR loops found in immunoglobulins, the loops the Tendamistat may serve a 
30 similar function and can be easily randomized by in vitro mutagenesis. 

Tendamistat, however, is derived from Streptomyces tendae. Thus, 
while Tendamistat may be antigenic in humans, its small size may reduce or 
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inhibit its antigenicity. Also, Tendamistat's stability- is uncertain. Fnrther.the 
stability that is reported for Tendamistat is attributed to the presence of two 
disulfide bonds. Disulfide bonds, however, are a significant disadvantage to 
such molecules in that they can be broken under reducing conditions and must be 

5 properly formed in order to have a useful protein structure. Further, the size of 
the loops in Tendamistat are relatively small, thus limiting the size of the inserts 
that can be accommodated in the scaffold. Moreover, it is well known that 
forming correct disulfide bonds in newly synthesized peptides is not 
straightforward. When a protein is expressed in the cytoplasmic space ofE. coli, 

1 0 the most common host bacterium for protein overexpression, disulfide bonds are 
usuaUy not formed, potentially making it difficult to prepare large quantities of 
engineered molecules. 

Thus, there is an on-going need for small, single-chain artificial 
antibodies for a variety of therapeutic, diagnostic and catalytic appUcations. hi 

1 5 particular, there is an on-going need for artificial antibodies that are structurally 
stable at neutral pH. 

SUMMARY OF THE INVENTION 

The present invention provides a fibronectin type III (Fn3) molecule, 
20 wherein the Fn3 contains a stabilizing mutation. A stabilizing mutation is 

defined herein as a modification or change in the amino acid sequence of the Fn3 
molecule, such as a substitution of one amino acid for another, that increases the 
melting point of the molecule by more than 0. 1 °C as compared to a molecule 
that is identical except for the change. Alternatively, the change may increase 
25 the melting point by more than 0.5°C or even 1 .0°C or more. A method for 
determining the melting point of Fn3 molecules is given in Example 19 below. 

The Fn3 may have at least one aspartic acid (Asp) residue and/or at least 
one glutamic acid (Glu) residue that has been deleted or substituted with at least 
one other amino add residue. For example. Asp 7 and/or Asp 23 and/or Glu 9, 
30 may have been deleted or substituted with at least one other amino acid residue. 
Asp 7, Asp 23, or Glu 9, may have been substituted with an asparagine (Asn) or 
lysine (Lys) residue. The present invention further provides an isolated nucleic 
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acid molecule and an expression vector encoding an Fn3 molecule wherein the 
Fn3 contains a stabilizing mutation. 

The invention provides a fibronectin type III (Fn3) polypeptide 
monobody containing a plurality of Fn3 P-strand domain sequences that are 
5 Hnked to a plurality of loop region sequences wherein the Fn3 contains a 

stabilizing mutation. One or more of the monobody loop region sequences of the 
Fn3 polypeptide vary by deletion, insertion or replacement of at least two amino 
acids from the corresponding loop region sequences in wild-type Fn3. The p- 
strand domains of the monobody have at least about 50% total amino acid 
10 sequence homology to the corresponding amino acid sequence of wild-type 

Fn3's p-strand domain sequences. Preferably, one or more of the loop regions of 
the monobody contain amino acid residues: 

i) from 15 to 1 6 inclusive in an AB loop; 

ii) from 22 to 30 inclusive in a BC loop; 
15 iii) from 39 to 45 inclusive in a CD loop; 

iv) from 5 1 to 55 inclusive in a DE loop; 

v) from 60 to 66 inclusive in an EF loop; and 

vi) from 76 to 87 inclusive in an FG loop. 

The invention also provides a nucleic acid molecule encoding a Fn3 
20 polypeptide monobody wherein the Fn3 contains a stabilizing mutation, as well 
as an expression vector containing the nucleic acid molecule and a host cell 
containing the vector. 

The invention further provides a method of preparing a Fn3 polypeptide 
monobody wherein the Fn3 contains a stabilizing mutation. The method 
25 includes providing a DNA sequence encoding a plurality of Fn3 p-strand domain 
sequences that are linked to a plurality of loop region sequences, wherein at least 
one loop region of the sequence contains a unique restriction enzyme site. The 
DNA sequence is cleaved at the unique restriction site. Then a preselected DNA 
segment is inserted into the restriction site. The preselected DNA segment 
30 encodes a peptide capable of binding to a specific binding partner (SEP) or a 
transition state analog compound (TSAC). The insertion of the preselected DNA 
segment into the DNA sequence yields a DNA molecule which encodes a 
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polypeptide monobody having an insertion. The DNA molecule is then 
expressed so as to yield the polypeptide monobody. 

Also provided is a method of preparing a Fn3 polypeptide monobody 
wherein the Fn3 contains a stabilizing mutation, which method includes 
5 providing a replicatable DMA sequence encoding a plurality of Fn3 p-strand 
domain sequences that are linked to a plurality of loop region sequences, wherein 
the nucleotide sequence of at least one loop region is known. Polymerase chain 
reaction (PGR) primers are provided or prepared which are sufficiently 
complementary to the known loop sequence so as to be hybridizable under PGR 
1 0 conditions, wherein at least one of the primers contains a modified nucleic acid 
sequence to be inserted into the DNA sequence. PGR is performed using the 
replicatable DNA sequence and the primers. The reaction product of the PGR is 
then expressed so as to yield a polypeptide monobody. 

The invention provides a further method of preparing a Fn3 polypeptide 
1 5 monobody wherein the Fn3 contains a stabilizing mutation. The method 

mcludes providing a rephcatable DNA sequence encoding a plurality of Fn3 P- 
strand domain sequences that are linked to a pluraUty of loop region sequences, 
wherein the nucleotide sequence of at least one loop region is known. Site- 
directed mutagenesis of at least one loop region is performed so as to create an 
20 insertion mutation. The resultant DNA including the insertion mutation is then 
expressed. 

Further provided is a variegated nucleic acid library encoding Fn3 
polypeptide monobodies including a plurality of nucleic acid species encoding a 
plurality of Fn3 p-strand domain sequences that are linked to a plurality of loop 

25 region sequences, wherein one or more of the monobody loop region sequences 
vary by deletion, insertion or replacement of at least two amino adds from 
corresponding loop region sequences in wild-type Fn3, and wherein the p-strand 
domains of the monobody have at least a 50% total amino acid sequence 
homology to the corresponding amino acid sequence of P-strand domain 

30 sequences of the wild-type Fn3, and wherein the Fn3 contains a stabiHzing 

mutation. The invention also provides a peptide display library derived from the 
variegated nucleic acid library of the invention. Preferably, the peptide of the 
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peptide display library is displayed on the sxirface of a bacteriophage, eg., a Ml 3 
bacteriophage or a fd bacteriophage, or virus. 

The invention also provides a method of identifying the amino acid 
sequence of a polypeptide molecule capable of binding to a specific binding 
5 partner (SBP) so as to form a polypeptide: SSP complex, v^herein the dissociation 
constant of the the polypeptide: SBP complex is less than 1 0*^ moles/liter. The 
method includes the steps of: 

a) providing a peptide display library of the invention; 

b) contacting the peptide display library of (a) with an immobilized 
10 or separable SBP; 

c) separating the peptide:SBP complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to 
result in a new peptide display library distinguished from tiiat in 
(a) by having a lowered diversity and by being enriched in 

1 5 displayed peptides capable of binding the SBP; 

e ) optionally repeating steps (b), (c), and (d) with the new library of 
(d); and 

f) determining tiie nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and hence deducing the 

20 peptide sequence capable of binding to the SBP. 

The present invention also provides a method of preparing a variegated 
nucleic acid library encoding Fn3 polypeptide monobodies having a plurality of 
nucleic acid species each including a plurality of loop regions, wherein the 
species encode a plurality of Fn3 P-strand domain sequences that are linked to a 

25 plurality of loop region sequences, wherein one or more of the loop region 

sequences vary by deletion, insertion or replacement of at least two amino acids 
from corresponding loop region sequences in wild-type Fn3, and wherein the 
p-strand domain sequences of the monobody have at least a 50% total amino 
acid sequence homology to the corresponding amino acid sequences of p-strand 

30 domain sequences of the wild-type Fn3, and wherein the Fn3 contains a 
stabilizing mutation, including the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
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sequence; 

b) contacting the polypeptide with a specific binding partner (SBP) 
so as to form a polypeptide: SSP complex wherein the dissociation 
constant of the the polypeptide:SBP complex is less than 10'^ 

5 moles/liter; 

c) determining the binding stracture of the polypeptidetSBP 
complex by nuclear magnetic resonance spectroscopy or X-ray 
crystallography; and 

d) preparing the variegated nucleic acid library, wherein the 

[ 0 variegation is performed at positions in the nucleic acid sequence 

which, fi-om the information provided in (c), result in one or more 
polypeptides with improved binding to the SBP. 
Also provided is a method of identifying the amino acid sequence of a 
polypeptide molecule capable of catalyzing a chemical reaction with a catalyzed 
1 5 rate constant, k^,, and an uncatalyzed rate constant, k^^,,^, such that the ratio of 
kcat^l^ncat gTcatcr than 10. The method mcludes the steps of: 

a) providing a peptide display library of the invention; 

b) contacting the peptide display library of (a) with an immobilized 
or separable transition state analog compound (TSAC) 

20 representing the approximate molecular transition state of the 

chemical reaction; 

c) separating the peptide:TSAC complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to 
result in a new peptide display library distinguished from that in 

25 (a) by having a lowered diversity and by being enriched in 

displayed peptides capable of binding the TSAC; 

e) optionally repeating steps (b), (c), and (d) with the new library of 
(d); and 

f) determining the nucleic acid sequence of the region encoding the 
30 displayed peptide of a species from (d) and hence deducing the 

peptide sequence. 

The invention also provides a method of preparing a variegated nucleic 
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acid library encoding Fn3 polypeptide monobodies having a plurality of nucleic 
acid species each including a plurality of loop regions, wherein the species 
encode a plurality of Fn3 p-strand domain sequences that are linked to a plurality 
of loop region sequences, wherein one or more of the loop region sequences vary 
5 by deletion, insertion or replacement of at least two amino acids from 

corresponding loop region sequences in wild-type Fn3, and wherein the P-strand 
domain sequences of the monobody have at least a 50% total amino acid 
sequence homology to the corresponding amino acid sequences of p-strand 
domain sequences of the wild-type Fn3, and wherein the Fn3 contains a 
1 0 stabilizing mutation, including the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence, wherein the polypeptide is capable of catalyzing a 
chemical reaction with a catalyzed rate constant, k^at, and an 
uncatalyzed rate constant, k^^gj, such that the ratio of k^t/k^^3j is 

15 greater than 10; 

b) contacting the polypeptide with an immobiHzed or separable 
transition state analog compound (TSAC) representing the 
approximate molecular transition state of the chemical reaction; 

c) determining the binding structure of the polypeptide:TSAC 

20 complex by nuclear magnetic resonance spectroscopy or X-ray 

crystallography; and 

d) preparing the variegated nucleic acid library, wherein the 
variegation is perfomied at positions in the nucleic acid sequence 
which, from the information provided in (c), result in one or more 

25 polypeptides with improved binding to or stabilization of the 

TSAC. 

The invention also provides a kit for the performance of any of the 
methods of the invention. The invention further provides a composition, e.g., a 
polypeptide, prepared by the use of the kit, or identified by any of the methods of 
30 the invention. 

The following abbreviations have been used in describing amino acids, 
peptides, or proteins: Ala or A, Alanine; Arg or R, Arginine; Asn or N 
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asparagine; Asp or D, aspartic add; Cys or C, cysteine; Gin or Q, glutamine; Glu 
or E, glutamic acid; Gly or G, glycine; His or H, histidine; He or I, isoleucine; 
Leu or L, leucine; Lys or K, lysine; Met or M, metbionine; Phe or F, 
phenylalanine; Pro or P, proline; Ser or S, serine; Thr or T, threonine; Trp or W, 
tryptophan; Tyr or Y, tyrosine; Val or V, valine. 

The following abbreviations have been used in describing nucleic acids, 
DNA, or RNA: A, adenosine; T, thymidine; G, guanosine; C, cytosine. 



10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. p-Strand and loop topology (A, B) and MOLSCRIPT 
representation (C, D; Kraulis, 1991) of the VH domain of anti-lysozyme 
immunoglobulin D1.3 (A, C; Bhat et al, 1994) and 10th type III domain of 
human fibronectin (B, D; Main et al, 1992). The locations of complementarity 
15 determining regions (CDRs, hypervariable regions) and the integrin-binding 
Arg-Gly-Asp (RGD) sequence are indicated. 

Figure 2. Amino acid sequence (SEQ ID N0:1 10) and restriction sites 
of the synthetic Fn3 gene. The residue numbering is according to Main et al. 
(1992). Restriction enzyme sites designed are shown above the amino acid 
20 sequence. P-Strands are denoted by underlines. The N-teraiinal "mq" sequence 
has been added for a subsequent cloning into an expression vector. The His-tag 
(Novagen) fusion protein has an additional sequence, 
MGSSHHHHHHSSGLVPRGSH (SEQ ID N0:1 14), preceding the Fn3 

sequence shown above. 
25 Figure 3. A, Far UV CD spectra of wild-type Fn3 at 25°C and 90°C. 

Fn3 (50 liM) was dissolved in sodium acetate (50 mM, pH 4.6). B, thermal 
denaturation of Fn3 monitored at 215 nm. Temperatare was increased at arate 
of l°C/min. 

Figure 4. A, Ca trace of the crystal structure of the complex of lysozyme 
30 (HEL) and the Fv fragment of the anti-hen egg-white lysozyme (anti-HEL) 
antibody Dl .3 (Bhat et al, 1994). Side chains of the residues 99-102 of VH 
CDR3 which make contact with HEL, are also shown. B, Contact surface area 
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for each residue of the D1.3 VH-HEL and VH-VL interactions plotted vs. 
residue number of DL3 VH. Surface area and secondary structure were 
determined using the program DSSP (Kabsh and Sander, 1983). C and D, 
schematic drawings of the P-sheet structure of the F strand-loop-G strand 
5 moieties of Dl .3 VH (C) and Fn3 (D). The boxes denote residues in p-strands 
and ovals those not in strands. The shaded boxes indicate residues of which side 
chains are significantly buried. The broken lines indicate hydrogen bonds. 

Figure 5. Designed Fn3 gene showing DNA (SEQ ID NO:l 1 1) and 
amino acid (SEQ ID N0:1 12) sequences. The amino acid numbering is 
1 0 according to Main et al (1 992). The two loops that were randomized in 
combinatorial libraries are enclosed in boxes. 

Figure 6. Map of plasmid pAS45. Plasmid pAS45 is the expression 
vector of His*tag-Fn3. 

Figure 7. Map of plasmid pAS25. Plasmid pAS25 is the expression 
15 vector of Fn3. 

Figures. Map of plasmid pAS38. pAS38 is aphagmid vector for the 
surface display of Fn3 . 

Figure 9. (Ubiquitin-1) Characterization of ligand-specific binding of 
enriched clones using phage enzyme-linked immunosolvent assay (ELISA). 

20 Microtiter plate wells were coated with ubiquitin (1 jig/well; "Ligand (+)) and 
then blocked with BSA. Phage solution in TBS containing approximately 10'° 
colony forming units (cfu) was added to a well and washed with TBS. Bound 
phages were detected with anti-phage antibody-POD conjugate (Pharmacia) with 
TurbO"TMB (Pierce) as a substrate. Absorbance was measured using a 

25 Molecular Devices SPECTRAmax 250 microplate spectrophotometer. For a 
control, wells without the immobilized Hgand were used. 2-1 and 2-2 denote 
enriched clones firom Library 2 eluted with free ligand and acid, respectively. 4- 
1 and 4-2 denote enriched clones from Library 4 eluted with free ligand and acid, 
respectively. 

30 Figure 10. (Ubiquitin-2) Competition phage ELISA of enriched clones. 

Phage solutions containing approximately 10*** cfii were first incubated with free 
ubiquitin at 4°C for 1 hour prior to the binding to a ligand-coated well. The 
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wells were washed and phages detected as described above. 

Figure 11. Competition phage ELISA of ubiquitin-binding monobody 
41 L Experimental conditions are the same as described above for ubiquitin. 
The ELISA was performed in the presence of free ubiquitin in the binding 
5 solution. The experiments were performed with fonr different preparations of 
the same clone. 

Figure 12. (Fluorescein- 1) Phage ELISA of four clones, Plb25.1 
(containing SEQ ID NO: 1 1 5), Plb25.4 (containing SEQ ID NO: 1 1 6), pLB24, 1 
(containing SEQ ID NO:117) and pLB24.3 (containing SEQ ID NO:l 18). 
1 0 Experimental conditions are the same as ubiquitin-1 above. 

Figure 13. (Fluorescein-2) Competition ELISA of the four clones. 
Experimental conditions are the same as ubiquitin-2 above. 

Figure 14, ^H, ^^N-HSQC spectrum of a fluorescence-binding 
monobody LB25.5. Approximately 20 \iM protein was dissolved in 10 mM 
1 5 sodium acetate buffer (pH 5.0) containing 100 mM sodium chloride. The 
spectrum was collected at 30^C on a Varian Unity INOVA 600 NMR 
spectrometer. 

Figure 15. Characterization of the binding reaction of Ubi4-Fn3 to the 
target, ubiquitin. (a) Phage ELISA analysis of binding of Ubi4-Fn3 to ubiquitin. 
20 The binding of Ubi4-phages to ubiquitin-coated wells was measured. The 
control experiment was performed with wells containing no ubiquitin. 

(b) Competition phage ELISA of Ubi4-Fn3. Ubi4-Fn3-phages were 
preincubated with soluble ubiquitin at an indicated concentration, followed by 
the phage ELISA detection in ubiquitin-coated wells. 
25 (c) Competition phage ELISA testing the specificity of the Ubi4 clone. 

The Ubi4 phages were preincubated with 250 ^g/ml of soluble proteins, 
followed by phage ELISA as in (b). 

(d) ELISA using free proteins. 

Figure 16. Equilibrium unfolding curves forUbi4-Fn3 (closed symbols) 
30 'and wild-type Fn3 (open symbols). Squares indicate data measured in TBS (Tris 
HCl buffer (50 mM, pH 7.5) containing NaCl (150 mM)). Circles indicate data 
measured in Gly HCl buffer (20 mM, pH 3.3) containing NaCl (300 mM). The 
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curves show the best fit of the transition curve based on the two-state model. 
Parameters characterizing the transitions are listed in Table 8. 

Figure 17. (a) ^H, ^^N-HSQC spectrum of ['^N]-Ubi4-K Fn3. 
(b). Difference (6^;,^.^ - 6^^^^) of (b) and (c) chemical shifts plotted 
5 versus residue number. Values for residues 82-84 (shovm as filled circles) 
where Ubi4-K deletions are set to zero. Open circles indicate residues that are 
mutated in the Ubi4-K protein. The locations of p-strands are indicated with 
arrows. 

Figure 18. (A) Guanidine hydrochloride (GuHCl)-induced denaturation 

10 of FNfiilO monitored by Trp fluorescence. The fluorescence emission intensity 
at 355 nm is shown as a function of GuHCl concentration. The lines show the 
best fits of the data to the two-state transition model. (B) StabiUty of FN3 at 4 M 
GuHCl plotted as a function of pH. (C) pH dependence of the m value. 

Figure 19. A two-dimensional H(C)CO spectrum of FNfiil 0 showing 

15 the '^C chemical shift of the carboxyl carbon (vertical axis) and the *H shift of 
'H^ of Asp or ^H^ of Glu, respectively (horizontal axis). Cross peaks are labeled 
with their respective residue numbers. 

Figure 20. pH-Dependent shifts of the '^C chemical shifts of the 
carboxyl carbons of Asp and Glu residues in FNfiilO. Panel A shows data for 

20 Asp 3, 67 and 80, and Glu 38 and 47. The lines are the best fits of the data to the 
Henderson-Hasselbalch equation with one ionizable group (Mcintosh, L. P., 
Hand, G., Johnson, P. E., Joshi, M. D., Koemer, M., Plesniak, L. A., Ziser, L., 
Wakarchuk, W. W. & Withers, S. G. (1996) Biochemistiy 55, 9958-9966). 
Panel B shows data for Asp 7 and 23 and Glu 9. The contmuous lines show the 

25 best fits to the Henderson-Hasselbalch equation with two ionizable groups, while 
the dashed lines show the best fits to the equation with a single ionizable group. 

Figure 21. (A) The amino acid sequence of FNfiilO (SEQ ID NO: 121) 
shown according to its topology (Main, A. L., Harvey, T. S,, Baron, M., Boyd, J„ 
& Campbell, L D. (1992) Cell 71, 671-678). Asp and Glu residues are 

30 highlighted with gray circles. The thin lines and arrows connecting circles 
indicate backbone hydrogen bonds. (B) A CPK model of FN3 showing the 
locations of Asp 7 and 23 and Glu 9. 
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Fi<nire 22. Thermal denaturation of the wild-type and mutant FNfiil 0 

proteins at pH 7.0 and 2.4 in the presence of 6.3 M urea and 0.1 or 1.0 M NaCl. 

Change in circular dichroism signal at 227 nm is plotted as a function of 

temperature. The filled circles show the data in the presence of 1 M NaCl and 
5 the open circles are data in the presence of 0. 1 M NaCl. The left column shows 

data taken at pH 2.4 and the right column at pH 7.0. The identity of proteins is 

indicated in the panels. 

Figure 23. GuHCl-induce denaturation of FNfiilO mutants monitored 

with fluorescence. Fluorescence data was converted to the fraction of unfolded 
1 0 protein according to the two-state transition model (Loladze, V. V., Ibarra- 

Molero, B., Sanchez-Ruiz, J. M. & Makhatadze, G. I. (1999) Biochemistry 38, 

16419-16423), and plotted as a fimction of GuHCl. 

Figure 24. pH Titration of the carboxyl '^C resonance of Asp and Glu 

residues in D7N (open circles) and D7K (closed circles) FNfiilO. Data for the 
1 5 wild-type (crosses) are also shown for comparison. Residue names are denoted 

in the individual panels. 

DETAILED DESCRIPTION OF THE INVENTION 

20 For the past decade the immune system has been exploited as a rich 

source ofde novo catalysts. Catalytic antibodies have been shown to have 
chemoselectivity, enantioselectivity, large rate accelerations, and even an ability 
to reroute chemical reactions. In most cases the antibodies have been elicited to 
transition state analog (TSA) haptens. These TSA haptens are stable, low- 

25 molecular weight compounds designed to mimic the structures of the 

energetically unstable transition state species that briefly (approximate half-hfe 
10"'^ s) appear along reaction pathways between reactants and products. 
Anti-TSA antibodies, like nattaral enzymes, are thought to selectively bind and 
stabilize transition state, thereby easing the passage of reactants to products. 

30 Thus, upon bmding, the antibody lowers the energy of the actual transition state 
and increases the rate of the reaction. These catalysts can be programmed to 
bind to geometrical and electrostatic features of the transition state so that the 
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reaction route can be controlled by neutralizing unfavorable charges, overcoming 
entropic barriers, and dictating stereoelectronic features of the reaction. By this 
means even reactions that are otherwise highly disfavored have been catalyzed 
(Janda et al 1997). Further, in many instances catalysts have been made for 
5 reactions for which there are no known natural or man-made enzymes. 

The success of any combinatorial chemical system in obtaining a 
particular function depends on the size of the library and the ability to access its 
members. Most often the antibodies that are made in an animal against a hapten 
that mimics the transition state of a reaction are first screened for binding to the 

10 hapten and then screened again for catalytic activity. An improved method 
allows for the direct selection for catalysis firom antibody libraries in phage, 
thereby linking chemistry and replication. 

A library of antibody fragments can be created on the surface of 
filamentous phage viruses by adding randomized antibody genes to the gene that 

1 5 encodes the phage's coat protein. Each phage then expresses and displays 

multiple copies of a single antibody fragment on its surface. Because each phage 
possesses both the surface-displayed antibody fragment and the DNA that 
encodes that fragment, and antibody fragment that binds to a target can be 
identified by amplifying the associated DNA. 

20 Immunochemists use as antigens materials that have as little chemical 

reactivity as possible. It is almost always the case that one wishes the ultimate 
antibody to interact with native structures. In reactive immunization the concept 
is just the opposite. One immunizes with compounds that are highly reactive so 
that upon binding to the antibody molecule during the induction process, a 

25 chemical reaction ensues. Later this same chemical reaction becomes part of the 
mechanism of the catal)4ic event. In a certain sense one is immunizing with a 
chemical reaction rather than a substance per se. Reactive immunogens can be 
considered as analogous to the mechanism-based inhibitors that enzymologists 
use except that they are used in the inverse way in that, instead of inhibiting a 

30 mechanism, they induce a mechanism. 

Man-made catalytic antibodies have considerable commercial potential in 
many different applications. Catalytic antibody-based products have been used 
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successfolly in prototype experiments in therapeutic applications, such as 
prodrug activation and cocaine inactivation, and in nontherapeutic applications, 
such as biosensors and organic synthesis. 

Catalytic antibodies are theoretically more attractive than noncatalytic 
antibodies as therapeutic agents because, being catalytic, they may be used in 
lower doses, and also because their effects are unusually irreversible (for 
iple, peptide bond cleavage rather than binding). In therapy, purified 
atalytic antibodies could be directly administered to a patient, or alternatively 
the patient's own catalytic antibody response could be elicited by immunization 
10 with an appropriate hapten. Catalytic antibodies also could be used as clinical 
diagnostic tools or as regioselective or stereoselective catalysts in the synthesis 
of fine chemicals. 

I. Mutation of F n^ loo ps and praf t ing of Ab loops onto Fn3 

1 5 An ideal scaffold for CDR grafting is highly soluble and stable. It is 

small enough for structural analysis, yet large enough to accommodate multiple 
CDRs so as to achieve tight binding and/or high specificity. 

A novel strategy to generate an artificial Ab system on the firamework of 
an existing non-Ab protein was developed. An advantage of this approach over 
20 the minimization of an Ab scaffold is that one can avoid inheriting the undesired 
properties of Abs. Fibronectin type ffl domain (Fn3) was used as the scaffold. 
Fibronectin is a large protein which plays essential roles in the formation of 
extracellular matrix and cell-cell interactions; it consists of many repeats of three 
types (I, n and III) of small domains (Baronera/., 1991). Fn3 itself is the 
25 paradigm of a large subfamily (Fn3 family or s-type Ig family) of the 

immunoglobulin superfamily (IgSF). The Fn3 family includes cell adhesion 
molecules, cell surface hormone and cytokine receptors, chaperonins, and 
carbohydrate-binding domains (for reviews, see Bork & Doolittle, 1992; Jones, 
1993; Bork et al, 1994; Campbell & Spitzfaden, 1994; Harpez & Chothia, 
30 1994). 

Recently, crystallographic studies revealed that the structure of the DNA 
binding domains of the transcription factor NF-kB is also closely related to the 
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FnS fold (Ghosh et al, 1995; Miiller et al, 1995). These proteins are all 
involved in specific molecular recognition, and inmost cases ligand-binding 
sites are formed by surface loops, suggesting that the Fn3 scaffold is an excellent 
framework for building specific binding proteins. The 3D structure of Fn3 has 
5 been determined by NMR (Main et al , 1 992) and by X-ray crystallography 
(Leahy et al, 1992; Dickinson et al, 1994). The structure is best described as a 
p-sandwich similar to that of Ab VH domain except that FnS has seven P-strands 
instead of nine (Fig. 1). There are three loops on each end of Fn3; the positions 
of the BC, DE and FG loops approximately correspond to those of CDRl, 2 and 

10 3 of the VH domain, respectively (Fig. 1 C, D), 

Fn3 is small (- 95 residues), monomeric, soluble and stable. It is one of 
few members of IgSF that do not have disulfide bonds; VH has an interstrand 
disulfide bond (Fig. 1 A) and has marginal stability under reducing conditions. 
FnS has been expressed in E. coli (Aukhil et al, 1993). In addition, 17 Fn3 

1 5 domains are present just in human fibronectia, providing important information 
on conserved residues which are often important for the stability and folding (for 
sequence alignment, see Main et al, 1992 and Dickinson et al, 1994). From 
sequence analysis, large variations are seen in the BC and FG loops, suggesting 
. that the loops are not crucial to stability. NMR studies have revealed that the FG 

20 loop is highly flexible; the flexibility has been implicated for the specific binding 
of the 1 0th Fn3 to ajP, integrin through the Arg-Gly-Asp (RGD) motif. In the 
crystal structure of human growth hormone-receptor complex (de Vos et al, 
1992), the second Fn3 domain of the receptor interacts v/ifh hormone via the FG 
and BC loops, suggesting it is feasible to build a binding site using the two 

25 loops. 

The tenth type III module of fibronectin has a fold similar to that of 
immunoglobulin domains, with seven p strands forming two antiparallel p 
sheets, which pack against each other (Main et al, 1992), The structure of the 
type II module consists of seven p strands, which form a sandwich of two 
.30 antiparallel P sheets, one containing three strands (ABE) and the other four 

strands (C'CFG) (Williams et al, 1988). The triple-stranded p sheet consists of 
residues Glu-9-Thr-14 (A), Ser-17-Asp-23 (B), and Thr-56-Ser-60 (E). The 
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majority of the conserved residues contribute to the hydrophobic core, with the 
invariant hydrophobic residues Trp-22 and Try-68 lying toward theN-terminal 
and C-terminal ends of the core, respectively. The p strands are much less 
flexible and appear to provide a rigid framework upon which functional, flexible 
5 loops are built. The topology is similar to fliat of immunoglobulin C domains. 

dmp. ponstruction and mut agenesis 

A synthetic gene for tenth Fn3 of human fibronectin (Fig. 2) was 
designed which includes convenient restriction sites for ease of mutagenesis and 
10 uses specific codons for high-level protein expression (Gribskov et al, 1984). 

The gene was assembled as follows: (1) the gene sequence was divided 
into five parts with boundaries at designed restriction sites (Fig.2); (2) for each 
part, a pair of oligonucleotides that code opposite strands and have 
complementary overlaps of ~ 15 bases was synthesized; (3) the two 
1 5 oligonucleotides were annealed and single strand regions were filled in using the 
Klenow fragment of DNA polymerase; (4) the double-sfranded oligonucleotide 
was cloned into the pETSa vector (Novagen) using restriction enzyme sites at the 
termini of the fragment and its sequence was confirmed by an AppUed 
Biosystems DNA sequencer using the dideoxy termination protocol provided by 
20 the manufacturer; (5) steps 2-4 were repeated to obtain the whole gene (plasmid 
pAS25) (Fig. 7). 

Although the present method takes more time to assemble a gene than the 
one-step polymerase chain reaction (PGR) method (Sandhu et al, 1992), no 
mutations occurred in the gene. Mutations would likely have been introduced by 

25 the low fidelity replication by Taq polymerase and would have required time- 
consuming gene editing. The gene was also cloned into the pET15b (Novagen) 
vector (pEWl). Both vectors expressed the Fn3 gene under the control of 
bacteriophage T7 promoter (Studler et al. 1990); pAS25 expressed the 96- 
residue Fn3 protein only, while pEWl expressed Fn3 as a fiasion protein with 

30 poly-histidine peptide (His-tag). Recombinant DNA manipulations were 
performed according to Molecular Cloning (Sambrook et al, 1989), unless 
otherwise stated. 
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Mutations were introduced to the Fn3 gene using either cassette 
mutagenesis or oligonucleotide site-directed mutagenesis techniques (Deng & 
Nickoloff, 1992). Cassette mutagenesis was performed using the same protocol 
for gene construction described above; double-stranded DNA fragment coding a 
5 new sequence was cloned into an expression vector (pAS25 and/or pEWl ). 
Many mutations can be made by combining a newly synthesized strand (coding 
mutations) and an oligonucleotide used for the gene synthesis. The resulting 
genes were sequenced to confirm that the designed mutations and no other 
mutations were introduced by mutagenesis reactions. 

10 

Design and synthesis of Fn3 mutants with antibody CDRs 

Two candidate loops (FG and BC) were identified for grafting. 
Antibodies with known crystal structures were examined in order to identify 
candidates for the sources of loops to be grafted onto Fn3. Anti-hen egg 

1 5 lysozyme (HEL) antibody D 1 .3 (Bhat et al , 1 994) was chosen as the source of a 
CDR loop. The reasons for this choice were: (1) high resolution crystal 
structures of the free and complexed states are available (Fig. 4 A; Bhat et al, 
1994), (2) thermodynamics data for the binding reaction are available (Tello et 
al, 1993), (3) Dl .3 has been used as a paradigm for Ab structural analysis and 

20 Ab engineering (Verhoeyen et al, 1988; McCafferty et al, 1990) (4) site- 
directed mutagenesis experiments have shown that CDR3 of the heavy chain 
(VH-CDR3) makes a larger contribution to the affinity than the other CDRs 
(Hawkins et al, 1993), and (5) a binding assay can be easily performed. The 
objective for this trial was to graft VH-CDR3 of D1.3 onto the Fn3 scaffold 

25 without significant loss of stability. 

An analysis of the Dl .3 structure (Fig. 4) revealed that only residues 99- 
102 ("RDYR") (SEQ ID NO:120) make direct contact with hen egg-white 
lysozyme (HEL) (Fig. 4 B), although VH-CDR3 is defined as longer (Bhater al, 
1994). It should be noted that the C-terminal half of VH-CDR3 (residues 101- 

30 104) made significant contact with the VL domain (Fig. 4 B). It has also 
become clear that D1.3 VH-CDR3 (Fig. 4 C) has a shorter turn between the 
strands F and G than the FG loop of Fn3 (Fig. 4 D). Therefore, mutant 
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sequences were designed by using the RDYR (99-102) (SEQ ID NO:120) of 
D1.3 as tbe core and made different boundaries and loop lengths (Table 1). 
Shorter loops may mimic the Dl .3 CDR3 conformation better, thereby yielding 
higher affinity, but they may also significantly reduce stability by removing wiL 
5 type interactions of Fn3 . 
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Table 1. Amino acid sequences of D1.3 VH CDR3, VH8 CDR3 and Fn3 FG 
loop and list of planned mutants. 





96 100 105 

• • • 




D1.3 


ARERDYRLDYWGQG 


(SEQIDNOrl) 


VH8 


ARGAVVS YYAMD YWGQG 


(SEQIDNO:2) 




.75 80 85 




Fh3 


• • • 

YAVTGRGDSPASSKPI 


(SEQIDN0:3) 


Mutant 


Sequence 


Dl.3-1 


YAERD YRLDY PI 


(SEQIDNO:4) 


D 1.3-2 


YAVRDYRLDY PI 


(SEQIDNO:5) 


DL3-3 


YAVRDYRLDYASSKPI 


(SEQIDNO:6) 


D 1.3-4 


YAVRDYRLDY KPI 


(SEQ IDN0:7) 


Dl.3-5 


YAVRDYR SKPI 


(SEQIDN0:8) 


D 1.3-6 


YAVTRDYRL — SSKPI 


(SEQ IDNO:9) 


Dl.3-7 


YAVTERDYRL-SSKPI 


(SEQ ID NO: 10) 


VH8-1 


YAVAVVS YYAMD Y-PI 


(SEQ IDNOrll) 


VH8-2 


YAVTAVVSYYASSKPI 


(SEO ID NO: 12) 



Underlines indicate residues in P-strands. Bold 
characters indicate replaced residues. 

20 

In addition, an anti-HEL single VH domain termed VH8 (Ward et ai, 
1989) was chosen as a template. VH8 was selected by library screening and, in 
spite of the lack of the VL domain, VH8 has an affinity for HEL of 27 nM, 

25 probably due to its longer VH-CDR3 (Table 1). Therefore, its VH-CDR3 was 
grafted onto Fn3. Longer loops may be advantageous on the Fn3 framework 
because they may provide higher afBnity and also are close to the loop length of 
wild-type Fn3. The 3D structure of VH8 was not known and thus the VH8 
CDR3 sequence was aligned with that of D 1.3 VH-CDR3; two loops were 

30 designed (Table 1). 
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Mutant construction and pr oduction 

Site-directed mutagenesis experiments were performed to obtain 
designed sequences. Two mutant Fn3s,Dl. 3-1 and Dl. 3-4 (Table 1) were 
obtained and both were expressed as soluble His'tag fusion proteins. Dl.3-4 
5 was purified and the His-tag portion was removed by thrombin cleavage. Dl.3-4 
is soluble up to at least 1 mM at pH 7.2. No aggregation of the protein has been 
observed during sample preparation and NMR data acquisition. 

Prntein ex pressio n and purification 

1 0 E. coli BL21 (DE3) (Novagen) were transformed with an expression 

vector (pAS25, pEWl and their derivatives) containing a gene for the wild-type 
or a mutant. Cells were grown in M9 minimal medium and M9 medium 
supplemented v^th Bactotrypton (Difco) containing ampiciUin (200 ng/ml). For 
isotopic labeling, "^N NH^Cl and/or '^C glucose replaced unlabeled components. 
15 500 ml medium in a 2 liter baffle flask were inoculated with 10 ml of overnight 
culture and agitated at ITC. Isopropylthio-p-galactoside (IPTG) was added at a 
final concentration of 1 mM to initiate protein expression when OD (600 nm) 
reaches one. The cells were harvested by centrifugation 3 hours after the 
addition of IPTG and kept frozen at -70'C until used. 
20 Fn3 without His-tag was purified as follows. Cells were suspended in 

5 ml/(g cell) of Tris (50 mM, pH 7.6) containing ethylenediaminetetraacetic acid 
(EDTA; 1 mM) and phenyhnethylsulfonyl fluoride (1 mM). HEL was added to a 
final concentration of 0.5 mg/ml. After incubating the solution for 30 minutes at 
37-C, it was sonicated three times for 30 seconds on ice. Cell debris was 
25 removed by centrifiigation. Ammonium sulfate was added to the solution and 
precipitate recovered by centrifiigation. The pellet was dissolved in 5-10 ml 
sodium acetate (50 mM, pH 4.6) and insoluble material was removed by 
centrifiigation. The solution was applied to a Sephacryl SIOOHR column 
(Pharmacia) equilibrated in the sodium acetate buffer. Fractions containing Fn3 
30 then was appUed to a Resources column (Pharmacia) equilibrated in sodium 

acetate (50 mM, pH 4.6) and eluted with a linear gradient of sodium chloride (0- 
0.5 M). The protocol can be adjusted to purify mutant proteins with different 
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surface charge properties. 

Fn3 with His^tag was purified as follows. The soluble fraction was 
prepared as described above, except that sodium phosphate buffer (50 mM, pH 
7.6) containing sodium chloride (100 mM) replaced the Tris buffer. The 
5 solution was apphed to a Hi-Trap chelating colvinm (Pharmacia) preloaded with 
nickel and equilibrated in the phosphate buffer. After washing the column with 
the buffer, His»tag-Fn3 was eluted in the phosphate buffer containing 50 mM 
EDTA. Fractions containing His»tag-Fn3 were pooled and applied to a 
Sephacryl S 1 00-HR column, yielding highly pure protein. The His*tag portion 

10 was cleaved off by treating the fusion protein with thrombin using the protocol 
supplied by Novagen. FnS was separated from the His*tag peptide and thrombin 
by a Resources column using the protocol above. 

The wild-type and two mutant proteins so far examined are expressed as 
soluble proteins. In the case that a mutant is expressed as inclusion bodies 

1 5 (insoluble aggregate), it is first examined if it can be expressed as a soluble 

protein at lower temperature (e.g., 25-30" C). If this is not possible, the inclusion 
bodies are collected by low-speed centrifiigation following cell lysis as described 
above. The pellet is washed with buffer, sonicated and centrifiiged. The 
inclusion bodies are solubilized in phosphate buffer (50 mM, pH 7.6) containing 

20 guanidinium chloride (GdnCl, 6 M) and will be loaded on a Hi-Trap chelating 
column. The protein is eluted with the buffer containing GdnCl and 50 mM 
EDTA, 

Conformation of mutant Fn3. Dl.3-4 

25 The NMR spectra of His-tag Dl .3-4 fusion protein closely resembled 

that of the wild-type, suggesting the mutant is folded in a similar conformation to 
that of the wild-type. The spectrum of Dl .3-4 after the removal of the His-tag 
peptide showed a large spectral dispersion. A large dispersion of amide protons 
(7-9.5 ppm) and a large number of downfield (5.0-6.5 ppm) C" protons are 

30 characteristic of a (i-sheet protein (Wuthrich, 1986). 

The 2D NOESY spectrum of Dl.3-4 provided further evidence for a 
preserved confomiation. The region in the spectrum showed interactions 
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between .pfield methyl protons (< 0.5 ppm) and methyl-n^ethylene protons. The 
Val72 Y methyl resonances were well separated in the wild-type spectrum (-0.07 
and 0 37 ppm; (Baron a/., 1992)). Resonances corresponding to the two 
^e&yl protons are present in fte Dl .3-4 spectrum (-0.07 and 0.44 ppm). ^ 
5 crosspeakbetweenthesetworesonancesandotherconservedcrosspeaks 

indicate that the two resonances in theD1.3-4spect.^arehighly likely thoseof 

Val72 and that other methyl protons are in nearly identical environment to that of 
wild-type Fn3. Minor differences between the two spectra are presumablydue to 

sn^allstructuralperturbationduetothemutations. Val72 is on the F strand, 
10 whereitfonnsapartofthecentralhydrophobiccoreofFn3(Main..a/.,1992). 
It is only four residues away from fl^emutatedresiduesofAeFG loop (Tablel). 

The results are remarkable because, despite there being 7 mutations and 3 
deletions in the loop (more than 10% of total residues; Fig. 12, Table 2), Dl.3-4 
retains a 3D structure virtually identical to that of the wild-type (except for the 
15 mutatedloop). Therefore, the results provide strong support that tl.e FG loop rs 
not significantly contributing to the folding and stabilityoftheFnSmoleculeand 

thus that the FG loop can be mutated extensively. 



Table 2. Sequences of oUgonucleotides 
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Name Sequence - ^^.^^ 

CGGGATCCCAlATGCAGGTrrCTGATGTTCCGCGTGACC 

TGGAAGTrGTTGCTGCGACC (SEQ ID N0:13) 
TAACTGCAGGAGCATCCCAGCTGATCAGCAGGCTAGTC 

GGGGTCGCAGCAACAAC (SEQ ID N0:14) 
CTCCTSCAGTTACCGTGCGTTATTACCGTATCACGTACG 

GTGAAACCGGTG (SEQ ID N0:15) 

GTGAATTQCTGAACCGGGGAGTTACCACCGGTrrCACC 
G(SEQIDN0:16) 

AGGMTTCACTGTACCTGGTTCCAAGTCTACTGCTACCA 
TCAGCGG (SEQ ID NO: 17) 

GTATAGICGACACCCGGTTTCAGGCCGCTGATGGTAGC 



FNIF 
FNIR 

25 

FN2F 
FN2R 
30 FN3F 



FN3R 
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FN5R 



10 FN5R' 



15 



BC3 



20 FG2 



FG3 
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(SEQIDN0:18) 

FN4F CGGGTGIQGACTATACCATCACTGTATACGCT (SEQ ID 

NO: 19) 

FN4R CGGGATCCGAGCTCGCTGGGCTGTCACCACGGCCAGTA 

ACAGCGTATACAGTGAT (SEQ ID NO:20) 
FN5F CAGCGAGCTCCAAGCCAATCTCGATTAACTACCGT (SEQ 

IDN0:21) 

CGGGATCCTCGAGTTACTAGGTACGGTAGTTAATCGA 
(SEQIDNO:22) 

CGGGATCCACGCGTGCCACCGGTACGGTAGTTAATCGA 
(SEQIDNO:23) 

CGGGAICQACGCGTCCATTCGTTTGTGAATATCAAGGCC 
AATCG (SEQ ID NO:24) 

CCGGAAGCTTTAAGACTCCTTATTACGCAGTATGTTAGC 
(SEQIDNO:25) 

38TAABglII CTGTTACTGGCCGTGAGATCTAACCAGCGAGCTCCA 
(SEQIDNO:26) 

GATCAGCTGGGATGCTCCTNNKI^NKNNKNNK]^ 
ACCGTATCACGTA (SEQ ID NO:27) 
TGTATACGCTGTTACTGGCNNKNNK]W]amK:NN^^ 
NKTCCAAGCCAATCJCGAT (SEQ ID NO:28) 
CTGTATACGCTGTTACTGGCN>nQWKNNKNNKCCAGCG 
AGCTCCAAG (SEQ ID NO:29) 

CATCACTGTATACGCTGTTACTNNKNNKNNKNNKNNKT 
CCAAGCCAATCTC (SEQ ID NO:30) 



gene3F 



gene3R 



FG4 



Restriction enzyme sites are underlined. N and K denote an equimolar mixture 
of A, T. G and C and that of G and T, respectively. 



30 Structure and stability measurements 

Structures of Abs were analyzed using quantitative methods {e.g., DSSP 
(Kabsch & Sander, 1983) and PDBfit (D. McRee, The Scripps Research 
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tostitute)) as «ell as computer graphics (eg., Q^ta (Molecular SimulaUons) 
and What if (G. Vriend, European Molecular Biology Laboratory)) to 
superirrrposethe strand-loop-strand structures of Abs and Fn3. 

n>. stability otmonobodies was detemm.ed by measuring temperature- 
5 andohemicaldenatutant-inducedunfoldingreactiDns(Pace«<:/.,1989). The 

temperature-induced unfolding reaction was measured usfag a circular dichrotsm 
(CD) polarimeter. EUipticity at 222 and 215 tm. was recorded as the sample 
temperature was slowly raised. Sample concentrations between 10 and 50 ,rM 
were used. After the unfolding baseline was established, fte tempe^nrre was 
10 loweredto examine ftereversibility of the unfoldingteaction. Freeenergyof 

unfolding was determined by fitting data to the equation for the two-state 
transition (Beclctel&Schelhnan.l987;Pacee,d., 1989). Nonlinear least- 

scuaxesfittingwas performed usmg the propam Igor (WaveMetrics)ona 

Macintosh computer. , .t, 

15 The stmcture and stability of two selected mutant Fn3s were studred; the 

first mutant was Dl.3-4 (Table 2) and the second was a mutant called AS40 
wbiehcontainsfourmutationsin.heBCloop(A»VW)-TQRQ).AS40 

was randomly chosen from the BC loop Ubrary described above. Bo* mutants 
were expressed as soluhle proteins in £. coK and were concentrate atleastto 

20 mM, permitting NMR studies. 

The mid-point of die thennal denamration for both mutants was 
appn>ximately 69-C, as compared to approximately 79-C for to wild-type 
protein Tire results indicated that the extensive mutations a. the two surface 
loops did no. drastically debase fte stability of Fn3, and ftus demonstrated the 
25 feasibilityofintroducingalargenumherofmutationsinbothloops. 

Stability was also determined by guanidmium chloride (GdnClV and 
„ea-induoed unfolding reactions. Prelhninary unfolding curves were recorded 
using a fluorometer equipped with a motor-ddven syringe; GdnCl or urea were 
added continuously to the protein solution in the cuvette. Based on the 
30 preliminary unfolding curves, separate samples containmg varying concentrahon 
of a denaturant were prepared and fluorescence (excitadon a. 290 mn. emisston 
at 300-400 nm) or CD (effipticity a, 222 and 21 5 nm) were measured after dre 
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samples were equilibrated at the measurement temperature for at least one hour. 
The curve was fitted by the least-squares method to the equation for the two- 
state model (Santoro & Bolen, 1988; Koide et al, 1993). The change in protein 
concentration was compensated if required. 
5 Once the reversibiUty of the themial unfolding reaction is established, the 

xmfolding reaction is measured by a Microcal MC-2 differential scanning 
calorimeter (DSC). The cell 1.3 ml) will be filled with FnAb solution (0.1 - 
1 mM) and ACp (= AH/AT) will be recorded as the temperature is slowly raised. 
T^ (the midpoint of unfolding), AH of unfolding and AG of unfolding is 
1 0 determined by fitting the transition curve (Privalov & Potekhin, 1 986) with the 
Origin software provided by Microcal. 

Thermal unfolding 

A temperature-induced unfolding experiment on Fn3 was performed 
15 using circular dichroism (CD) spectroscopy to monitor changes in secondary 
structure. The CD spectrum of the native Fn3 shows a weak signal near 222 nm 
(Fig. 3A), consistent with the predominantly p-structure of Fn3 (Perczel et al, 
1 992). A cooperative unfolding transition is observed at 80-90" C, clearly 
indicating high stability of Fn3 (Fig. 3B). The fi-ee energy of unfolding could not 
20 be determined due to the lack of a post-transition baseline. The result is 

consistent with the high stability of the first FnS domain of human fibronectin 
(Litvinovich et al, 1992), thus indicating that Fn3 domains are in general highly 
stable. 

25 Binding assays 

The binding reactions of monobodies were characterized quantitatively 
using an isothermal titration calorimeter (ITC) and fluorescence spectroscopy. 

The enthalpy change (AH) of binding were measured using a Microcal 
Omega ITC (Wiseman era/., 1989). The sample cell (- 1.3 ml) was filled with 
30 Monobody solution (< 1 00 p.M, changed according to K^, and the reference cell 
filled with distilled water; the system was equilibrated at a given temperature 
until a stable baseline is obtained; 5-20 ^1 of Ugand solution {<. 2 mM) was 
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injected by a motor-driven syringe within a short duration (20 sec) followed by 
an equilibration delay (4 minutes); the injection was repeated and heat 
generation/absorption for each injection was measured. From the change in the 
observed heat change as a function of ligand concentration, AH and was 
5 detemuned (Wiseman et al, 1989). AG and AS of the binding reaction was 
deduced from the two directly measured parameters. Deviation from the 
theoretical curve was examined to assess nonspecific (multiple-site) binding. 
Experiments were also be performed by placing a Hgand in the cell and titrating 
with an FnAb. It should be emphasized that only ITC gives direct measurement 
1 0 of AH, thereby making it possible to evaluate enthalpic and entropic 

contributions to the binding energy. ITC was successfully used to monitor the 
binding reaction of the D1.3 Ab (Tello et al, 1993; Bhat et al., 1994). 

hitrinsic fluorescence is monitored to measure binding reactions with 
in the sub-|iM range where the determination of by ITC is difticult. Tip 
15 fluorescence (excitation at - 290 nm, emission at 300-350 ran) and Tyr 

fluorescence (excitation at ~ 260 ran, emission at ~ 303 nm) is monitored as the 
Fn3-mutant solution (< 10 \iM) is tifrated with Ugand solution (^ 100 [iM). 
of the reaction is determined by the nonlinear least-squares fitting of the 
bimolecular binding equation. Presence of secondary binding sites is examined 
20 using Scatchard analysis, hi all binding assays, control experiments are 
perforaied busing wild-type Fn3 (or unrelated monobodies) in place of 
monobodies of interest. 

H. Production of Fnl mutants with high affinity and specificity 

25 Monobodies 

Library screening was carried out in order to select monobodies that bind 
to specific ligands. This is complementary to the modeling approach described 
above. The advantage of combinatorial screening is that one can easily produce 
and screen a large number of variants (. 1 0"), which is not feasible with specific 

30 mutagenesis ("rational design") approaches. The phage display technique 

(Smith, 1985; O'Neil & Hoess, 1995) was used to effect the screening processes. 
Fn3 was fiised to a phage coat protein (pIII) and displayed on the surface of 
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filamentous phages. These phages harbor a single-stranded DNA genome that 
contains the gene coding the Fn3 fusion protein. The amino acid sequence of 
defined regions of Fn3 were randomized using a degenerate nucleotide sequence, 
thereby constructing a Kbrary. Phages displaying Fn3 mutants with desired 
5 binding capabilities were selected in vitro, recovered and ampUfied. The amino 
acid sequence of a selected clone can be identified readily by sequencing the Fn3 
gene of the selected phage. The protocols of Smith (Smith & Scott, 1993) were 
followed with minor modifications . 

The objective was to produce Monobodies which have high affinity to 

10 small protein ligands. HEL and the Bl domain of staphylococcal protein G 
(hereafter referred to as protein G) were used as ligands. Protein G is small 
(56 amino acids) and highly stable (Minor & Kim, 1994; Smith et al, 1994). Its 
structure was determined by NMR spectroscopy (Gronenbom et a/., 1991) to be 
a helix packed against a four-strand P-sheet. The resulting FnAb-protein G 

15 complexes (-150 residues) is one of the smallest protein-protein complexes 
produced to date, well within the range of direct NMR methods. The small size, 
the high stability and solubility of both components and the ability to label each 
with stable isotopes ('^C and *^N; see below for protein G) make the complexes 
an ideal model system for NMR studies on protein-protein interactions. 

20 The successful loop replacement of Fn3 (the mutant Dl .3-4) demonstrate 

that at least ten residues can be mutated without the loss of the global fold. 
Based on this, a library was first constructed in which only residues in the FG 
loop are randomized. After results of loop replacement experiments on the BC 
loop were obtained, mutation sites were extended that include the BC'loop and 

25 other sites. 

Construction of Fn3 phage display system 

An Ml 3 phage-based expression vector p ASM 1 has been constructed as 
follows: an oligonucleotide coding the signal peptide of OmpT was cloned at 
30 the 5' end of the Fn3 gene; a gene fragment coding the C-terminal domain of 
Ml 3 pIII was prepared from the wild-type gene III gene of M13 mpl 8 using 
PGR (Corey et al, 1993) and the fragment was inserted at the 3' end of the 
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OmpT-Fn3 gene; a spacer sequence has been inserted between Fn3 and pIII. The 
resultant fragment (OmpT-Fn3-pni) was cloned in the multiple cloning site of 
M13 mpl8, where the fusion gene is under the control of the lac promoter. Tins 
system will produce the Fn3-pm fusion protein as well as the wild-type pHI 
5 protein. TT.e co-expression of wild-type pDI is expected to reduce the number of 
fusion pin protein, thereby increasing the phage infectivity (Corey a/ al, 1993) 
(five copies of pIII are present on a phage particle), hi addition, a smaller 
number of fusion pIH protein may be advantageous in selecting tight bindmg 
proteins, because the chelating effect due to multiple binding sites should be 
10 smaller than that with all five copies effusion pill (Bass et ai, 1990). Tins 

system has successfully displayed the serine protease trypsin (Corey al, 1993). 
Phages were produced and purified using E. coli K91kan (Smith & Scott, 1993) 
according to a standard method (Sambrook et al, 1989) except that phage 
particles were purified by a second polyethylene glycol precipitation and acid 

15 precipitation. 

Successfiil display of Fn3 on fusion phages has been confirmed by 
ELISA using an Ab against fibronectin (Sigma), clearly indicating that it is 
feasible to construct libraries using this system. 

An alternative system using the fUSE5 (Parmley & Smith, 1988) may 
20 also be used. The Fn3 gene is inserted to fUSE5 using the Sfil restriction sites 
introduced at the 5'- and 3'- ends of the Fn3 gene PGR. Hiis system displays 
onlythefiasionpniprotein(uptofivecopies)onthesurfaceofaphage. Phages 

are produced and purified as described (Smith & Scott, 1993). "ms system has 
been used to display many proteins and is robust. The advantage of fUSE5 is its 
25 low toxicity. This is due to the low copy number of the replication foma (RF) in 
the host, which in turn makes it difficult to prepare a sufficient amount of RP for 
library construction (Smith & Scott, 1993). 
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rnnstnictinw nf lihrarics 

■ The first library was constructed of the Fn3 domain displayed on the 
surface of Ml 3 phage in which seven residues (77-83) in the FG loop (Fig. 4D) 
: randomized. Randomization will be achieved by the use of an 
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oligonucleotide containing degenerated nucleotide sequence. A double-stranded 
nucleotide was prepared by the same protocol as for gene synthesis (see above) 
except that one strand had an (NNK)6(NNG) sequence at the mutation sites, 
where N corresponds to an equimolar mixture of A, T, G and C and K 
5 corresponds to an equimolar mixture of G and T. The (NNG) codon at residue 
83 was required to conserve the Sad restriction site (Fig. 2). The (NNK) codon 
codes all of the 20 amino acids, while the NNG codon codes 14. Therefore, this 
library contained - 10^ independent sequences. The library was constmcted by 
ligating the double-stranded nucleotide into the wild-type phage vector, pASMl, 

10 and the transfecting E. coli XLl blue (Stratagene) using electroporation. XLl 
blue has the lacF phenotype and thus suppresses the expression of the Fn3-pIII 
fusion protein in the absence of lac inducers. The initial library was propagated 
in this way, to avoid selection against toxic Fn3-pin clones. Phages displaying 
the randomized Fn3-pIII fusion protein were prepared by propagating phages 

1 5 with K9 1 kan as the host. K9 1 kan does not suppress the production of the fusion 
protein, because it does not have lacP. Another library was also generated in 
which the BC loop (residues 26-20) was randomized. 

Selection of displayed Monobodies 

20 Screening of Fn3 phage libraries was performed using the bioparming 

protocol (Smith & Scott, 1993); a ligand is biotinylated and the strong biotin- 
streptavidin interaction was used to immobilize the ligand on a streptavidin- 
coated dish. Experiments were performed at room temperature 22°C). For 
the initial recovery of phages from a library, 10 ixg of a biotinylated ligand were 

25 immobilized on a streptavidin-coated polystyrene dish (35 mm, Falcon 1 008) 
and then a phage solution (containing - 10^^ pfu (plaque-forming unit)) was 
added. After washing the dish with an appropriate buffer (typically TEST, Tris- 
HCl (50 mM, pH 7.5), NaCl (150 mM) and Tween 20 (0.5%)), bound phages 
were eluted by one or combinations of the following conditions: low pH, an 

30 addition of a free ligand, urea (up to 6 M) and, in the case of anti-protein G 
Monobodies, cleaving the protein G-biotin linker by thrombin. Recovered 
phages were amplified using the standard protocol using K9 Ikan as the host 
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(Sambrook et ai, 1989). The selection process were repeated 3-5 times to 
concentrate positive clones. From the second round on, the amount of the ligand 
were graduaUy decreased (to ~ 1 jig) and the biotinylated ligand were mixed 
with a phage solution before transferring a dish (G. P. Smith, personal 

5 communication). After the final round, 10-20 clones were picked, and their 
DNA sequence will be determined. The ligand affinity of the clones were 
measured first by the phage-ELISA method (see below). 

To suppress potential binding of the Fn3 firamework (background 
binding) to a ligand, wild-type Fn3 may be added as a competitor in the buffers. 

10 In addition, unrelated proteins (eg., bovine serum albumin, cytochrome c and 
RNase A) may be used as competitors to select highly specific Monobodies. 



Binding assay 

The binding affinity of Monobodies on phage surface is characterized 
1 5 semi-quantitatively using the phage ELISA technique (Li et ai, 1995). Wells of 
microtiter plates (Nunc) are coated with a Hgand protein (or with streptavidin 
followed by the binding of a biotinylated ligand) and blocked with the Blotto 
solution (Pierce). Purified phages (~ 10'° pfii) originating fi-om single plaques 
(M13)/colonies (fUSE5) are added to each well and incubated overnight at 4°C. 
20 After washing wells with an appropriate buffer (see above), bound phages are 
detected by the standard ELISA protocol using anti-M13 Ab (rabbit, Sigma) and 
anti-rabbit Ig-peroxidase conjugate (Pierce) or using anti-M13 Ab-peroxidase 
conjugate (Pharmacia). Colormetric assays are performed using TMB (3,3',5,5'- 
tetramethylbenzidine, Pierce). The high affinity of protein G to 
25 immunoglobulins present a special problem; Abs cannot be used in detection. 
Therefore, to detect anti-protein G Monobodies, fusion phages are immobilized 
in wells and the binding is then measured using biotinylated protein G followed 
by the detection using streptavidin-peroxidase conjugate. 

30 Production of soluble Monobodies 

After preliminary characterization of mutant Fn3s using phage ELISA, 
mutant genes are subcloned into the expression vector pEWl. Mutant proteins 
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are produced as His*tag fusion proteins and purified, and their conformation, 
stability and ligand affinity are characterized. 

in. Increased Stability of Fn3 Scaffolds 

5 The definition of "higher stability" of a protein is the ability of a protein 

to retain its three-dimensional structure required for function at a higher 
temperature (in the case of thermal denaturation), and in the presence of a higher 
concentration of a denaturing chemical reagent such as guanidine hydrochloride. 
This type of "stabihty" is generally called "conformational stability." It has been 
10 shown that conformational stability is correlated with resistance against 

proteolytic degradation, z.e., breakdown of protein in the body (Kamtekar et al 
1993). 

Improving the conformational stability is a major goal in protein 
engineering. Here, mutations have been developed by the inventor that enhance 

1 5 the stability of the fibronectin type III domain (Fn3). The inventor has 

developed a technology in which FnS is used as a scaffold to engineer artificial 
binding proteins (Koide et al, 1998). It has been showoi that many residues in 
the surface loop regions of Fn3 can be mutated without disrupting the overall 
structure of the Fn3 molecule, and that variants of Fn3 with a novel bmding 

20 function can be engineered using combinatorial library screening (Koide et al, 
1998). The inventor found that, although Fn3 is an excellent scaffold, Fn3 
variants that contain large number of mutations are destablized against chemical 
denaturation, compared to the wild-type Fn3 protein (Koide et al, 1998). Thus, 
as the number of mutated positions are mutated in order to engineer a new 

25 binding function, the stability of such Fn3 variants further decreases, ultimately 
leading to marginally stable proteins. Because artificial binding proteins must 
maintain their three-dimensional structure to be functional, stability limits the 
number of mutations that can be introduced in the scaffold. Thus, modifications 
of the Fn3 scaffold that increase its stability are useful in that they allow one to 

30 introduce more mutations for better function, and that they make it possible to 
use Fn3-based engineered proteins in a wider range of apphcations. 

The inventor found that wild-type Fn3 is more stable at acidic pH than at 
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neutral pH (Koide et al., 1998). The pH dependence of Fn3 stability is 
characterized in Figure 1 8. The pH dependence curve has an apparent transition 
midpoint near pH 4 (Figure 1 8). These results suggest that by identifying and 
removing destablizing interactions in Fn3 one is able to improve the stability of 
5 Fn3 at neutral pH. It should be noted that most applications of engineered Fn3, 
such as diagnostics, therapeutics and catalysts, are expected to be used near 
neutral pH, and thus it is important to improve the stability at neutral pH. 
Studies by other investigators have demonstrated that the optimization of surface 
electrostatic properties can lead to a substantial increase in protein stability (Perl 
10 et al. 2000. Spector et al. 1999, Loladze et al 1999, Grimsley et al. 1999). 

The pH dependence of Fn3 stability suggests that amino acids with pJT^ 
near 4 ai-e involved in the observed transition. The carboxyl groups of aspartic 
acid (Asp) and glutamic acid (Glu) have piC^ in this range (Creighton, T.E. 
1 993). It is well known that if a carboxyl group has unfavorable {i.e. 
1 5 destabiUzing) interactions in a protein, its pZ^ is shifted to a higher value from 
its standard, unperturbed value (Yang and Honig 1992). Thus, the ^K, values of 
all carboxyl groups in Fn3 were determined using nuclear magnetic resonance 
(NMR) spectrosocpy, to identify carboxyl groups with unusual ^^s, as shown 
below. 

20 First, the '^C resonance for the carboxyl carbon of each Asp and Glu 

residue were assigned (Figure 19). Next pH titration of resonances was 
performed for these groups (Figure 20). The pisT^ values for these residues are 
listed in Table 3. 

25 Table 3. values for Asp and Glu residues in Fn3. 

Residue P-^o - — 

E9 5.09 

E38 3.79 

E47 3.94 

30 D3 3.66 

D7 3.54, 5.54* 

D23 3.54,5.25* 
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D67 4.18 

D80 3.40 

The standard deviation in the -pK^ values are less than 0.05 pH units. 

*Data for D7 and D23 were fitted with a transition curve with two jK„ values. 

5 

These results show that Asp 7 and 23, and Glu 9 have up-shifted pKJs with 
respect to their unperturbed pK^'s (approximately 4.0), indicating that these 
residues are involved in unfavorable interactions. In contrast, the other Asp and 
Glu residues have p-ST^'s close to the respective unperturbed values, indicating 
1 0 that the carboxyl groups of these residues do not significantly contribute to the 
stability of Fn3. 

In the three-dimensional structure of Fn3 (Main et aL 1992), Asp 7 and 
23, and Glu 9 form a patch on the surface (Figure 21), with Asp 7 centrally 
located in the patch. This spatial proximity of these negatively charged residues 

1 5 explains why these residues have unfavorable interactions in Fn3. At low pH 
where these residues are protonated and neutral, the unfavorable interactions are 
expected to be mostly relieved. At the same time, the structure suggests that the 
stability of Fn3 at neutral pH could be improved if the electrostatic repulsion 
between tliese three residues is removed. Because Asp 7 is centrally located 

20 among the three residues, it was decided to mutate Asp 7. Two mutants were 
prepared, D7N and D7K (z.e., the aspartic acid at amino acid residue number 7 
was substituted with an asparagine residue or a lysine residue, respectively). The 
former replaces the negative charge with a neutral residue of virtually the same 
size. The latter places a positive charge at residue 7. 

25 The degrees of stability of the mutant proteins were characterized in 

thermal and chemical denaturation measurements. In thermal denaturation 
measurements, denaturation of the Fn3 proteins was monitored using circular 
dichroism spectroscopy at the wavelength of 227 nm. All the proteins 
underwent a cooperative transition (Figure 22). From the transition curves, the 

30 midpoints of the transition (T^) for the wild-type, D7N and D7K were 

determined to be 62, 69 and 70 in 0.02 M sodium phosphate buffer (pH 7,0) 
containing 0.1 M sodium chloride and 6.2 M urea. Thus, the mutations 
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increased the of wild-type Fn3 by 7-8 ^C. 

Chemical denaturation of Fn3 proteins was monitored using fluorescence 
emission from the single Tip residue of Fn3 (Figure 23). The free energies of 
unfolding in the absence of guanidine HCl (AG^) were determined to be 7.4, 8.1 
5 and 8.0 kcal/mol for the wild-type, D7N and DTK, respectively (a larger AG^ 
indicates a higher stabiUty). The two mutants were again found to be more 
stable than the wild-type protein. 

These results show that a point mutation on the surface can significantly 
enhance the stability of Fn3. Because these mutations are on the surface, they 
1 0 minimally alter the structure of Fn3, and they can be easily introduced to other, 
engineered Fn3 proteins. In addition, mutations at Glu 9 and/or Asp 23 also 
enhance the stability of Fn3. Furthermore, mutations at one or more of these 
three residues can be combined. 

Thus, Fn3 is the fourth example of a monomeric inmiunoglobulin-like 
1 5 scaffold that can be used for engineering binding proteins. Successful selection 
of novel binding proteins have also been based on minibody, tendamistat and 
"camelized" inununoglobulin VH domain scaffolds (Martin et al, 1994; Davies 
& Riechmann, 1995; McCoimell & Hoess, 1995). The Fn3 scaffold has 
advantages over these systems. Bianchi et al reported that the stabiHty of a 
20 minibody was 2.5 kcal/mol, significantly lower than that of Ubi4"K. No detailed 
structural characterization of minibodies has been reported to date. Tendamistat 
and the VH domain contain disulfide bonds, and thus preparation of correctly 
folded proteins may be difficult. Davies and Riechmann reported that the yields 
of their camelized VH domains were less than 1 mg per liter culture (Davies & 
25 Riechmann, 1996). 

Thus, the Fn3 framework can be used as a scaffold for molecular 
recognition. Its small size, stability and well-characterized structure make Fn3 
an attractive system. In light of the ubiquitous presence of Fn3 in a wide variety 
of natural proteins involved in ligand binding, one can engineer Fn3-based 
30 binding proteins to different classes of targets. 

The following examples are intended to illustrate but not limit the 
invention. 
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EXAMPLE I 
Construction of the Fn3 gene 

A synthetic gene for tenth Fn3 of fibronectin (Fig.l) was designed on the 
basis of amino acid residue 1416-1509 of human fibronectin (Komblihtt, et al., 
5 1985) and its three dimensional structure (Main, et al, 1992). The gene was 
engineered to include convenient restriction sites for mutagenesis and the so- 
called "preferred codons" for high level protein expression (Gribskov, et al, 
1984) were used. In addition, a glutamine residue was inserted after the N- 
terminal methionine in order to avoid partial processing of the N-terminal 

10 methionine which often degrades NMR spectra (Smith, et al, 1994). Chemical 
reagents were of the analytical grade or better and purchased from Sigma 
Chemical Company and J.T. Baker, unless otherwise noted. Recombinant DNA 
procedures were performed as described in "Molecular Cloning" (Sambrook, et 
al, 1989), unless otherwise stated. Custom oligonucleotides were purchased 

1 5 from Operon Technologies, Restriction and modification enzymes were from 
New England Biolabs. 

The gene was assembled in the following manner. First, the gene 
sequence (Fig. 5) was divided into five parts with boundaries at designed 
restriction sites: fragment 1, Ndel-PstI (ohgonucleotides FNIF and FNIR (Table 

20 2); fragment 2, Pstl-EcoRI (FN2F and FN2R); fragment 3, EcoRI-SaU (FN3F 
and FN3R); fragment 4, Sall-SacI (FN4F and FN4R); fragment 5, SacI-BamHI 
(FN5F and FN5R). Second, for each part, a pair of oligonucleotides which code 
opposite strands and have complementary overlaps of approximately 15 bases 
was synthesized. These oligonucleotides were designated FN1F-FN5R and are 

25 shown in Table 2. Third, each pair {e.g., FNIF and FNIR) was annealed and 
single-strand regions were filled in using the Klenow fragment of DNA 
polymerase. Fourth, the double stranded oligonucleotide was digested with the 
relevant restriction enzymes at the termini of the fragment and cloned into the 
pBlueScript SK plasmid (Stratagene) which had been digested with the same 

30 enzymes as those used for the fragments. The DNA sequence of the inserted 

fragment was confirmed by DNA sequencing using an Applied Biosystems DNA 
sequencer and the dideoxy termination protocol provided by the manufacturer. 
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Last, steps 2-4 were repeated to obtain the entire gene. 

The gene was also cloned into the pET3a and pET15b (Novagen) vectors 
(pAS45 and pAS25, respectively). The maps of the plasmids are shown in Figs. 
6 and 7. E. coli BL21 (DE3) (Novagen) containing these vectors expressed the 
5 Fn3 gene under the control of bacteriophage T7 promotor (Studier, et al, 1990); 
pAS24 expresses the 96-residue Fn3 protein only, while pAS45 expresses Fn3 as 
a fiision protein with poly-histidine peptide (His-tag). High level expression of 
the Fn3 protein and its derivatives in £. coli was detected as an intense band on 

SDS-PAGE stained with CBB. 
10 The binding reaction of the monobodies is characterized quantitatively by 

means of fluorescence spectroscopy using purified soluble monobodies. 

Intrinsic fluorescence is monitored to measure binding reactions. Tip 
fluorescence (excitation at -290 mn, emission at 300 350 mn) and Tyr 
fluorescence (excitation at -260 mn, emission at -303 mn) is monitored as the 
15 Fn3-mutant solution (^ 100 jiM) is titrated with a ligand solution. When a 
ligand is fluorescent {e.g. fluorescein), fluorescence from the ligand may be 
used. K, of the reaction will be determined by the nonlinear least-squares fitting 
of the bimolecular binding equation. 

If intrinsic fluorescence cannot be used to monitor the binding reaction, 
20 monobodies are labeled with fluorescein-NHS (Pierce) and fluorescence 
polarization is used to monitor the binding reaction (Burke et al, 1996). 

EXAMPLE II 
Modifications to include restriction sites in the Fn3 gene 

25 The restriction sites were incorporated in the synthetic Fn3 gene without 

changing tiie amino acid sequence Fn3. The positions of the restriction sites 
were chosen so that the gene construction could be completed without 
synthesizing long (>60 bases) oligonucleotides and so that two loop regions 
could be mutated (including by randomization) by the cassette mutagenesis 

30 method {i.e., swapping a fragment with another synthetic fragment containing 
mutations). In addition, die restriction sites were chosen so that most sites were 
unique in the vector for phage display. Unique restriction sites allow one to 
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recombine monobody clones which have been already selected in order to supply 
a larger sequence space. 

EXAMPLE m 
5 Construction of M13 phage display libraries 

A vector for phage display, pAS38 (for its map, see Fig. 8) was 
constructed as follows. The Xbal-BamHI fragment of pET12a encoding the 
signal peptide of OmpT was cloned at the 5' end of the Fn3 gene. The C- 
terminal region (from the FN5F and FN5R oligonucleotides, see Table 2) of the 

1 0 Fn3 gene was replaced with a new fragment consisting of the FN5F and FN5R* 
oligonucleotides (Table 2) which introduced a Mlul site and a linker sequence 
for making a ftision protein with the pIII protein of bacteriophage Ml 3. A gene 
fragment coding the C-terminal domain of Ml 3 pIII was prepared from the wild- 
type gene III of M13mpl8 using PGR (Corey, aL, 1993) and the fragment was 

1 5 inserted at the 3* end of the OmpT-Fn3 fiision gene using the Mlul and Hindlll 
sites. 

Phages were produced and purified using a helper phage, Ml 3K07, 
according to a standard method (Sambrook, et al , 1 989) except that phage 
particles were purified by a second polyethylene glycol precipitation. Successful 
20 display of Fn3 on fusion phages was confirmed by ELISA (Harlow & Lane, 
1988) using an antibody against fibronectin (Sigma) and a custom anti-FN3 
antibody (Cocalico Biologicals, PA, USA). 

EXAMPLE IV 

25 Libraries containing loop variegations in the AB loop 

A nucleic acid phage display library having variegation in the AB loop is 
prepared by the following methods. Randomization is achieved by the use of 
oligonucleotides containing degenerated nucleotide sequence. Residues to be 
variegated are identified by examining the X-ray and NMR structures of FnS 
30 (Protein Data Bank accession numbers, IFNA and ITTF, respectively). 

Oligonucleotides containing NNK (N and K here denote an equimolar mixture of 
A, T, G, and C and an equimolar mixture of G and T, respectively) for the 
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variegated residues are synthesized (iee oligonucleotides BC3, FG2, FG3, and 
FG4 in Table 2 for example). The NNK mixture codes for all twenty amino 
acids and one termination codon (TAG). TAG, however, is suppressed in the E. 
coli XL-1 blue. Single-stranded DNAs of pAS38 (and its derivatives) are 
5 prepared using a standard protocol (Sambrook, et al, 1989). 

Site-directed mutagenesis is performed following published methods (see 
for example, Kunkel, 1985) using a Muta-Gene kit (BioRad). The libraries are 
constructed by electroporation ofE. coli XL-1 Blue electroporation competent 
cells (200 til; Stratagene) with Ijig of the plasmid DNA using a BTX electrocell 
10 manipulator ECM 395 1mm gap cuvette. A portion of the transformed cells is 
plated on an LB-agar plate containing ampiciUin (100 ^g/ml) to determine the 
transformation efficiency. Typically, 3 X 10? transfomiants are obtained with 1 
Jig of DNA, and thus a library contains 10« to 10^ independent clones. Phagemid 
particles were prepared as described above. 

15 

EXAMPLE V 
Loop variegations in the EC, CD, DE, EF or FG loop 

A nucleic acid phage display library having five variegated residues 
(residues number 26-30) in the BC loop, and one having seven variegated 
20 residues (residue numbers 78-84) in the FG loop, was prepared using the 
methods described in Example IV above. Other nucleic acid phage display 
libraries having variegation in the CD, DE or EF loop can be prepared by similar 
methods. 

25 EXAMPLE VI 

Loop variegations in the FG and BC loop 

A nucleic acid phage display library having seven variegated residues 
(residues number 78-84) in the FG loop and five variegated residues (residue 
number 26-30) in the BC loop was prepared. Variegations in the BC loop were 
30 prepared by site-directed mutagenesis (Kunkel, et al.) using the BC3 
oligonucleotide described in Table 1 . Variegations in the FG loop were 
introduced using site-directed mutagenesis using the BC loop library as the 
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starting material, thereby resulting in libraries containing variegations in both BC 
and FG loops. The oligonucleotide FG2 has variegating residues 78-84 and 
oligonucleotide FG4 has variegating residues 77-81 and a deletion of residues 
82-84. 

5 A nucleic acid phage display library having five variegated residues 

(residues 78-84) in the FG loop and a three residue deletion (residues 82-84) in 
the FG loop, and five variegated residues (residues 26-30) in the BC loop, v^as 
prepared. The shorter FG loop was made in an attempt to reduce the flexibility of 
the FG loop; the loop was shown to be highly flexible in Fn3 by the NMR 

10 studies of Main, et al (1992). A highly flexible loop may be disadvantageous to 
forming a binding site with a high affinity (a large entropy loss is expected upon 
the ligand binding, because the flexible loop should become more rigid). In 
addition, other Fn3 domains (besides human) have shorter FG loops (for 
sequence alignment, see Figure 12 in Dickmson, et al (1994)). 

1 5 Randomization was achieved by the use of oHgonucleotides containing 

degenerate nucleotide sequence (oligonucleotide BC3 for variegating the BC 
loop and oligonucleotides FG2 and FG4 for variegating the FG loops). 

Site-directed mutagenesis was performed following published methods 
(see for example, Kunkel, 1985). The hbraries were constructed by 

20 electrotransfonning E, coli XL- 1 Blue (Stratagene). Typically a library contains 
10^ to 10^ independent clones. Library 2 contains five variegated residues in the 
BC loop and seven variegated residues in the FG loop. Library 4 contains five 
variegated residues in each of the BC and FG loops, and the length of the FG 
loop was shortened by three residues. 

25 

EXAMPLE VII 
fd phage display libraries constructed with loop variegations 

Phage display libraries are constructed using the fd phage as the genetic 
vector. The Fn3 gene is inserted in fUSES (Parmley & Smith, 1988) using Sfil 
30 restriction sites which are introduced at the 5' and 3' ends of the Fn3 gene using 
PGR. The expression of this phage results in the display of fiie fiision pIII 
protein on the surface of the fd phage. Variegations in the Fn3 loops are 



wo 02/04523 



PCT/USO 1/21855 



44 

introduced using site-directed mutagenesis as described hereinabove, or by 
subcloning the Fn3 libraries constructed in Ml 3 phage into the fUSE5 vector. 

EXAMPLE Vra 
5 Other phage display libraries 

T7 phage libraries (Novagen, Madison, WI) and bacterial pili expression 
systems (Invitrogen) are also useful to express the Fn3 gene. 

EXAMPLE IX 

1 0 Isolation of polypeptides which bind to macromolecular strnetures 

The selection of phage-displayed monobodies was performed following 
the protocols of Barbas and coworkers (Rosenblum & Barbas, 1995). Briefly, 
approximately 1 \ig of a target molecule ("antigen") in sodium carbonate buffer 
(100 mM, pH 8.5) was immobilized in the wells of a microtiter plate (Maxisorp, 

1 5 Nunc) by incubating overnight at 4°C in an air tight container. After the removal 
of this solution, the wells were then blocked with a 3% solution of BSA (Sigma, 
Fraction V) in TBS by incubating the plate at 3TC for 1 hour. A phagemid 
library solution (50 ^l) containing approximately 10*^ colony forming units (cfii) 
of phagemid was absorbed in each well at 37°C for 1 hour. The wells were then 

20 washed with an appropriate buffer (typically TBST, 50 mM Tris-HCl (pH 7.5), 
150 mM NaCl, and 0.5% Tween20) three times (once for tiie first round). 
Bound phage were eluted by an acidic solution (typically, 0.1 M glycine-HCl, pH 
2.2; 50 jil) and recovered phage were immediately neutralized with 3 |xl of Tris 
solution. Alternatively, bound phage were eluted by incubating the wells with 

25 50 \i\ of TBS containing the antigen (1-10 \iM). Recovered phage were 

amplified using the standard protocol employing the XLlBlue cells as the host 
(Sambrook, et al). The selection process was repeated 5-6 times to concentrate 
positive clones. After the final round, individual clones were picked and their 
binding affinities and DNA sequences were determined. 

30 The binding affinities of monobodies on the phage surface were 

characterized using the phage ELISA technique (Li, et al, 1995). Wells of 
microtiter plates (Nunc) were coated with an antigen and blocked with BSA. 
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Purified phages (10^- 10*^ cfu) originating firom a single colony were added to 
each well and incubated 2 hours at 37°C. After washing wells with an 
appropriate buffer (see above), bound phage were detected by the standard 
ELISA protocol using anti-M13 antibody (rabbit, Sigma) and anti-rabbit Ig- 
5 peroxidase conjugate (Pierce). Colorimetric assays were performed using 
Turbo-TMB (3,3',5,5'-tetramethylben2idine, Pierce) as a substrate. 

The binding affinities of monobodies on the phage surface were further 
characterized using the competition ELISA method (Djavadi-Ohaniance, et al, 
1 996). In this experiment, phage ELISA is performed in the same manner as 

10 described above, except that the phage solution contains a ligand at varied 
concentrations. The phage solution was incubated a 4°C for one hoxir prior to 
the binding of an immobilized ligand in a microtiter plate well. The affinities of 
phage displayed monobodies are estimated by the decrease in ELISA signal as 
the free ligand concentration is increased. 

15 After preliminary characterization of monobodies displayed on the . 

surface of phage using phage ELISA, genes for positive clones were subcloned 
into the expression vector pAS45. E. coli BL21(DE3) (Novagen) was 
transformed with an expression vector (pAS45 and its derivatives). Cells were 
grown in M9 minimal medium and M9 medium supplemented with 

20 Bactotryptone (Difco) containing ampicillin (200 jig/ml). For isotopic labeling, 
^^N NH4CI and/or ^^C glucose rq)laced unlabeled components. Stable isotopes 
were purchased fi-om Isotec and Cambridge Isotope Labs. 500 ml medimn in a 2 
1 baffle flask was inoculated with 10 ml of overnight culture and agitated at 
approximately 140 rpm at 37°C. IPTG was added at a final concentration of 1 

25 mM to induce protein expression when OD(600 nm) reached approximately 1 .0. 
The cells were harvested by centrifiigation 3 hours after the addition of IPTG and 
kept frozen at -70°C until used. 

Fn3 and monobodies with His»tag were purified as follows. Cells were 
suspended in 5 ml/(g cell) of 50 mM Tris (pH 7.6) containing 1 mM 

30 phenylmethylsulfonyl fluoride. HEL (Sigma, 3X crystallized) was added to a 
final concentration of 0.5 mg/ml. After incubating the solution for 30 min at 
37°C, it was sonicated so as to cause cell breakage three times for 30 seconds on 
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10 



ice Cell debris was removed by centrifogation at 1 5,000 ipm in an Sorval RC- 
2B centrifuge using an SS-34 rotor. Concentrated sodium chloride is added to 
the solution toafinal concentration of0.5M. The solution was then apphed tea 

1 ml HisTrapTM chelating column (Pharmacia) preloaded with nickel chlonde 
(01 M 1 ml) and equilibrated in the Tris buffer (50 mM, pH 8.0) containing 0.5 
Msodium chloride. After washingthe column wi1hthebuffer,theboundprotem 

was eluted with a Tris buffer (50 mM, pH 8.0) containing 0.5 M imidazole. The 
His.tag portion was cleaved off, when required, by treating the fusion protem 
withthrombinusingtheprotocolsuppliedbyNovagen (Madison, WI). Fn3was 

separated from the His-tag peptide and thrombin by a Resources-column 
(Pharmacia) using a linear gradient of sodium chloride (0 - 0.5 M) in sodmm 

acetate buffer (20 mM, pH 5.0). 

Small amounts of soluble monobodies were prepared as follows. XL-1 
Blue cells containing pAS38 derivatives (plasmids coding FuS-pDI fusion 
15 proteins) were grown in LB media at 37°C with vigorous shaking untilOD(600 
nm) reached approximately 1 .0; IPTG was added to the culture to a final 
concentration of 1 mM, and the cells were further grown overnight at 37°C. 
cells were removed from themediumbycentrifiagation, and the supernatant was 

applied to a microtiter well coated with a ligand. Although XL-l Blue cells 
20 containing pAS38 and its derivatives express FN3-pin fusion proteins, soluble 

proteins are also produced due to the cleavage of the linker between the Fn3 and 

pni regions by proteolytic activities ofE. coli (Rosenblum & Barbas, 1995). 

Binding of amonobody to the ligand was examined by Ibe standard ELISA 

protocol using a custom antibody against Fn3 (purchased from Cocalico 
25 Biologicals, Reamstown, PA). Soluble monobodies obtained from the 

periplasmic fraction oiE. coli cells using a standard osmotic shock method were 

also used. 
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EXAMPLE X 
Ubiquitin binding monobody 

Ubiquitin is a small (76 residue) protein involved in the degradation 
pathway in eurkaiyotes. It is a single domain globular protein. Yeast ubiquitin 
5 was purchased from Sigma Chemical Company and was used without further 
purification. 

Libraries 2 and 4, described in Example VI above, were used to select 
ubiquitin-binding monobodies. Ubiquitin (1 ng in 50 |il sodium bicarbonate 
buffer (100 mM, pH 8.5)) was immobilized in the wells of a microtiter plate, 

1 0 followed by blocking with BSA (3% in TBS), Panning was perfomied as 

described above. In the first two rounds, 1 \ig of ubiquitin was immobilized per 
well, and bound phage were elute with an acidic solution. From the third to the 
sixth rounds, 0. 1 \ig of ubiquitin was immobihzed per well and the phage were 
eluted either with an acidic solution or with TBS containing 10 [iM ubiquitin. 

1 5 Binding of selected clones was tested first in the polyclonal mode, L e. , 

before isolating individual clones. Selected clones from all libraries showed 
significant binding to ubiquitin. These results are shown in Figure 9. The 
binding to the immobilized ubiquitin of the clones was inhibited almost 
completely by less than 30 p-M soluble ubiquitin in the competition ELISA 

20 experiments (see Fig. 10). The sequences of the BC and FG loops of ubiquitin- 
binding monobodies is shown in Table 4. 
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Table 4. Sequences of ubiquitin-binding monobodies 











Occurrence (if 




Name 


BC loo0 


FG loop 


■more than one) 




211 


CARRA 
(SEQIDN0:31) 


RWIPLAK 
(SEQIDNO:32) 


2 


5 


212 


CWRRA 
(SEQIDNO:33) 


RWVGLAW 
(SEQ ID NO:34) 






213 


CKHRR 

(SEQ ID NO:35) 


FADLWWR 
(SEQIDNO:36) 






214 


CRRGR 

(SEQ ID NO:37) 


RGFMWLS 
(SEQIDNO:38) 






215 


CNWRR 
(SEQIDNO:39) 


RAYRYRW 
(SEQIDNO:40) 






411 


SRLRR 

(SEQIDN0:41) 


PPWRV 
(SEQIDNO:42) 


9 


10 


422 


ARWTL 

(SEQ ID NO:43) 


RRWWW 
(SEQ ID N0:44) 






424 


GQRTF 

(SEOIDNO:45) 


RRWWA 
(SEOIDNO:46) 





The 41 1 clone, which was the most enriched clone, was characterized 
using phage ELISA. The 41 1 clone showed selective binding and inhibition i 
15 binding in the presence of about 10 nM ubiquitin in solution (Fig. 1 1). 



EXAMPLE XI 
Methods for the immobilization of small molecules 

Target molecules were immobilized in wells of a microtiter plate 
20 (Maxisorp, Nunc) as described hereinbelow, and the wells were blocked with 
BSA. In addition to the use of carrier protein as described below, a conjugate of 
a target molecule in biotin can be made. The biotinylated hgand can then be 
immobilized to a microtiter plate well which has been coated with streptavidin. 
In addition to the use of a carrier protein as described below, one conld 
25 make a conjugate of a target molecule and biotin (Pierce) and immobilize a 
biotinylated ligand to a microtiter plate well which has been coated with 
streptavidin (Smith and Scott, 1993). 

Small molecules may be conjugated with a carrier protein such as bovine 
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serum albumin (BSA, Sigma), and passively adsorbed to the microtiter plate 
well. Alternatively, methods of chemical conjugation can also be used. In 
addition, solid supports other than microtiter plates can readily be employed. 

5 EXAMPLE Xn 

Fluorescein binding monobody 

Fluorescein has been used as a target for the selection of antibodies from 
combinatorial libraries (Barbas, et al 1992). NHS-fluorescein was obtained 
from Pierce and used according to the manufacturer's instructions in preparing 

10 conjugates with BSA (Sigma). Two types of fluorescein-BSA conjugates were 
prepared with approximate molar ratios of 17 (fluorescein) to one (BSA), 

The selection process was repeated 5-6 times to concentrate positive 
clones. In this experiment, the phage library was incubated with a protein 
mixture (BSA, cytochrome C (Sigma, Horse) and RNaseA (Sigma, Bovine), 1 

1 5 mg/ml each) at room temperature for 30 minutes, prior to the addition to hgand 
coated wells. Bound phage were eluted in TBS containing 10 [xM soluble 
fluorescein, instead pf acid elution. After the final round, individual clones were 
picked and their binding affinities (see below) and DNA sequences were 
detennined. 
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Table 5. Clones from Library #2 



10 



15 



20 



25 



30 





BC 


FG 


WT 


AVTVR (SEQIDNO:47) 


RGDSPAS (SEQ IDNO:48) 




pLB24.1 


CNWRR (SEQ ID NO:49) 


RAYRYRW (SEQ ID jNU.dU) 


pLB24.2 


CMWRA (SEQ ID N0:51) 


RWGMLRR (SEQ ID 


pLB24.3 


ARMRE(SEQIDNO:53) 


RWLRGRY (SEQ ID N0.54) 


pLB24.4 


CARRR(SEQIDNO:55) 


RRAGWGW (SEQ ID NO:56) 


pLB24.5 


CNWRR (SEQ ED NO:57) 


RAYRYRW (SEQ ID NO:58) 


pLB24.6 


RWRER (SEQ ID NO:59) 


RHPWTER (SEQ ID NO:60) 


pLB24.7 


CNWRR (SEQ ID N0:61) 


RAYRYRW (SEQ ID NO:62) 


pLB24.8 


ERRVP(SEQIDNO:63) 


RLLLWQR (SEQ ID NO:64) 


pLB24.9 


GRGAG(SEQIDNO:65) 


FGSFERR (SEQ IDNO:66) 


pLB24.11 


CRWTR (SEQ ID NO:67) 


RRWFDGA (SEQ ID NO:68) 


PLB24.12 


CNWRR (SEQ ID NO:69) 


RAYRYRW (SEQ ID NO:70) 




Clones from Library #4 


WT AVTVR (SEQ ID N0:7 1) GRGDS (SEQ ID NO:72) 




pLB25.1 


GQRTF(SEQIDNO:73) 


RRWWA (SEQ ID NO:74) 


pLB25.2 


GQRTF (SEQ ID NO:75) 


RRWWA (SEQ ID NO:76) 


pLB25.3 


GQRTF(SEQIDNO:77) 


RRWWA (SEQ ID NO:78) 


pLB25.4 


LRYRS (SEQIDNO:79) 


GWRWR (SEQ ID NO:80) 


pLB25.5 


GQRTF (SEQ ID N0:81) 


RRWWA (SEQ ID NO:82) 


pLB25.6 


GQRTF (SEQ ID NO:83) 


RRWWA (SEQ ID NO:84) 


pLB25.7 


LRYRS (SEQIDNO:85) 


GWRWR (SEQ ID NO:86) 


pLB25.9 


LRYRS (SEQ ID NO:87) 


GWRWR (SEQ ID NO:88) 


pLB25.11 


GQR'IT(SEQIDNO:89) 


RRWWA (SEQ ID NO:90) 


PLB25.12 


LRYRS (SE0IDN0:91) 


GWRWR (SEO ID NO:92) 



Preliminary characterization of the binding affinities of selected clones 
were performed using phage ELISA and competition phage ELISA (see Fig. i: 
35 (Fluorescein-1) and Fig. 13 (Fluorescein-2)). The four clones tested showed 
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specific binding to the ligand-coated wells, and the binding reactions are 
inhibited by soluble fluorescein (see Fig. 13), 

EXAMPLE Xni 
5 Digoxigenin binding monobody 

Digoxigenin-3-O-methyl-carbonyl-e-aminocapronicacid-NHS 
(Boehringer Mannheim) is used to prepare a digoxigenin-BSA conjugate. The 
coupling reaction is performed foUowing the manufacturers* instructions. The 
digoxigenin-BS A conjugate is immobilized in the wells of a microtiter plate and 
1 0 used for panning. Panning is repeated 5 to 6 times to enrich binding clones. 
Because digoxigenin is sparingly soluble in aqueous solution, bound phages are 
eluted from the well using acidic solution. See Example XIV. 

EXAMPLE XIV 

1 5 TS AC (transition state analog compound) binding monobodies 

Carbonate hydrolyzing monobodies are selected as follows. A transition 
state analog for carbonate hydrolysis, 4-mtrophenyl phosphonate is synthesized 
by an Arbuzov reaction as described previously (Jacobs and Schultz, 1987). The 
phosphonate is then coupled to the carrier protein, BSA, using carbodiinaide, 

20 followed by exhaustive dialysis (Jacobs and Schultz, 1 987). The hapten-BSA 
conjugate is immobilized in the wells of a microtiter plate and monobody 
selection is performed as described above. Catalytic activities of selected 
monobodies are tested using 4-nitrophenyl carbonate as the substrate. 

Other haptens useful to produce catalytic monobodies are summarized in 

25 H. Suzuki (1994) and in N. R. Thomas (1994). 

EXAMPLE Xy 
NMR characterization of Fn3 and comparison of the Fn3 
secreted by yeast with that secreted by coli 
30 Nuclear magnetic resonance (NMR) experiments are performed to 

identity the contact surface between FnAb and a target molecule, e.g., 
monobodies to fluorescein, ubiquitin, RNaseA and soluble derivatives of 
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digoxigenin. The infonnation is then be used to improve the affinity and 

specificity of the monobody . Purified monobody samples are dissolved m an 
appropriatebuffer for NMR spectroscopy using Amicon ultrafiltration ceUwitha 

YM-3membrane. Buffers are made with 90 o/o Hp/10 %D,0 (distilled grade, 
5 Isotec) or with 100 % D,0. Deuterated compounds (e.g. acetate) are used to 
eliminate strong signals firom them. 

NMR experiments are performed on a Varian Unity INOVA 600 
spectrometer equipped with four RF channels and a triple resonance probe with 
pulsed field gradient capability. NMR spectra are analyzed using processmg 
10 programs such as Felix (Molecular Simulations), mnrPipe, PIPP, and CAPP 
(Garrett, et al, 1991; Delagho, et al, 1995) on UNIX workstations. Sequence 
specificresonanceassigmnents aremadeusingwell-established strategy usmga 

set of triple resonance expemnents (CBCA(CO)NH and HNCACB) (Grzesiek & 

Bax 1992; Wittenkind & Mueller, 1993). 
15 ' Nuclear Overhauser effect (NOE) is observed between -H nuclei closer 

than approximately 5 A, which allows one to obtain infonnation on interproton 
distances. A series of double- and triple-resonance experiments (Table 6; for 
recent reviews on these techniques, see Bax & Grzesiek, 1993 and Kay, 1995) 
are performed to collect distance {i.e. NOE) and dihedral angle (J-couplmg) 
.0 constraints. Isotope-filtered experiments are performed to detemiine resonance 
assigmnents of the bound ligand and to obtain distance constraints wtthm the 
Ugand and those between FnAb and the Hgand. Details of sequence specific 
resonance assigmnents and NOE peak assigmnents have been described m detail 
elsewhere (Clore & Gronenbom, 1991; Pascal, et al. 1994b; Metzler, et al, 
25 1996). 

Table 6. NMR experiments for structure characterization 

By ppriinPntName Reference 

30 1 . reference spectra 

2D-'H, -N-HSQC (Boder,hausen & Ruben. 1980; Kay, et al, 1992) 
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2D-'H, '^C-HSQC (Bodenhausen & Ruben, 1980; Vuister & Bax, 1992) 

2. backbone and side chain resonance assignments of '^C/"N-labeled protein 

5 3D-CBCA(C0)NH (Grzesiek &. Bax, 1992) 

3D-HNCACB (Wittenkind & MueUer, 1993) 

3D-C(C0)NH (Logan et al, 1992; Grzesiek et al. , 1993) 
3D-H(CC0)1SIH 

3D-HBHA(CBCACO)NH (Grzesiek & Bax, 1993) 

10 3D-HCCH-T0CSY (Kay a/., 1993) 

3D-HCCH-C0SY (Ikuraei a/., 1991) 

3D-'H, '^N-TOCSY-HSQC (Zhang a/., 1994) 

2D-HB(CBCDCE)HE (Yamazakie/a/., 1993) 

15 3. resonance assignments of unlabeled ligand 

2D-isotope-filtered 'H-TOCSY 
2D-isotope-fihered 'H-COSY 

2D-isotope-filtered 'H-NOESY (Ikura & Bax, 1992) 

20 

4. structural constraints 
within labeled protein 

3D-'H, '^N-NOESY-HSQC (Zhang e/ a/., 1994) 

4D-'H, "C-HMQC-NOESY-HMQC (Vuister et al, 1993) 
25 4D-'H, '^C, "N-HSQC-NOESY-HSQC (Muhandiramer al, 1993; Pascal et al., 1994a) 
within unlabeled ligand 

2D-isotope-filtered 'H-NOESY (Ikura & Bax, 1 992) 
interactions between protein and ligand 
3D-isotope-filtered 'H, "N-NOESY-HSQC 
30 3D-isotope-filtered 'H, '^C-NOESY-HSQC (Lee et al, 1994) 



5. dihedral angle constraints 



J-molulated 'H, '^N-HSQC (Billeter et al, 1 992) 

35 3D-HNHB (Archer e/ a/., 1991) 
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Backbone 'H, '^N and resonance assignments for a monobody are 
compared to those for wild-type Fn3 to assess structural changes in the mutant. 
Once these data establish that the mutant retains the global structure, structural 
5 refinement is performed usmg experimental NOE data. Because the structural 
difference of a monobody is expected to be minor, the wild-type structure can be 
used as the initial model after modifying the amino acid sequence. The 
mutations are introduced to the wild-type structure by interactive molecular 
modeling, and then the structure is energy-minimized using a molecular 
10 modeling program such as Quanta (Molecular Simulations). Solution structure 
is refined using cycles of dynamical simulated annealing (Nilges et al, 1988) in 
the program X-PLOR (Brunger, 1992). Typically, an ensemble of fifty structures 
is calculated. The validity of the refined structures is confirmed by calculating a 
fewer number of structures from randomly generated initial structures in X- 
15 PLOR using the YASAP protocol (Nilges, et al, 1991). Stmcture of a 
monobody-ligand complex is calculated by first refining both components 
individually using intramolecular NOEs, and then docking the two using 

intermolecular NOEs. 

For example, the 'H, '^N-HSQC spectrum for the fluorescein-binding 
20 monobody LB25.5 is shown in Figure 14. The spectrum shows a good 

dispersion (peaks are spread out) indicating that LB25.5 is folded into a globular 
conformation. Further, the spectrum resembles that for the wild-type Fn3, 
showing that the overall stmcture of LB25.5 is similar to that of Fn3. These 
results demonstrate that hgand-binding monobodies can be obtained without 
25 changing the global fold of the Fn3 scaffold. 

Chemical shift perturbation experiments are performed by forming the 
complex between an isotope-labeled FnAb and an unlabeled ligand. The 
formation of a stoichiometric complex is followed by recording the HSQC 
spectrum. Because chemical shift is extremely sensitive to nuclear environment, 
30 formation of a complex usually results in substantial chemical shift changes for 
resonances of amino acid residues in the interface. Isotope-edited NMR 
experiments (2D HSQC and 3D CBCA(CO)NH) are used to identify the 



55 

resonances that are perturbed in the labeled component of the complex; i.e. the 
monobody. Although the possibility of artifacts due to long-range 
conformational changes must always be considered, substantial differences for 
residues clustered on continuous surfaces are most likely to arise from direct 
5 contacts (Chen et al, 1993; Gronenbom & Clore, 1993). 

An alternative method for mapping the interaction surface utilizes amide 
hydrogen exchange (HX) measurements, HX rates for each amide proton are 
measured for ^^N labeled monobody both free and complexed with a ligaind. 
Ligand binding is expected to result in decreased amide HX rates for monobody 

1 0 residues in the interface between the two proteins, thus identifying the binding 
surface. HX rates for monobodies in the complex are measured by allowing HX 
to occur for a variable time following transfer of the complex to D2O; the 
complex is dissociated by lowering pH and the HSQC spectrum is recorded at 
low pH where amide HX is slow. Fn3 is stable and soluble at low pH, satisfying 

15 the prerequisite for the experiments. 

EXAMPLE XVI 

Construction and Analysis of Fn3-Display System Specific for Ubiquitin 

An Fn3-display system was designed and synthesized, ubiquitin-binding 
20 clones were isolated and a major Fn3 mutant in these clones was biophysically 
characterized. 

Gene construction and phage display of Fn3 was performed as in 
Examples I and II above. The Fn3-phage pIII fusion protein was expressed from 
a phagemid-display vector, while the other components of the Ml 3 phage, 

25 including the wild-type pIII, were produced using a helper phage (Bass et al , 

1990). Thus, a phage produced by this system should contain less than one copy 
of Fn3 displayed on the surface. The surface display of Fn3 on the phage was 
detected by ELISA using an anti"Fn3 antibody. Only phages containing the Fn3- 
pIII fusion vector reacted with the antibody. 

30 After confirming the phage surface to display Fn3, a phage display library 

of Fn3 was constructed as in Example III. Random sequences were introduced 
in the BC and FG loops. In the first library, five residues (77-81) were 
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randomized and three residues (82-84) were deleted from the FG loop. The 
deletion was intended to reduce the flexibility and improve the binding affinity 
of the FG loop- Five residues (26-30) were also randomized in the BC loop in 
order to provide a larger contact surface with the target molecule. Thus, the 
resulting library contains five randomized residues in each of the BC and FG 
loops (Table 7). This library contained approximately 10'' independent clones. 



T ihrarv Screening 

Library screening was performed using ubiquitin as the target molecule. 

10 hi each round of panning, Fn3-phages were absorbed to a ubiquitin-coated 
surface, and bound phages were eluted competitively with soluble ubiquitin. ^ 
The recovery ratio improved from 4.3 x 1 0"' in the second round to 4.5 x lO"^ in 
the fifth round, suggesting an enrichment of bmding clones. After five founds of 
panning, the amino acid sequences of individual clones were detemiined (Table 

15 7). 



Table 7. Sequences in the variegated loops of enriched clones 



20 



Name 


BC loop ^ ] 


FGloop Frequency 


Wild 
Type 


GCAGTTACCGTGCGT 
(SEQIDNO:93) 
AlaValThrValArg 
(SEQ ED NO:94) 


GGCCGTGGTGACAGCCCAGCGAGC 

(SEQ IDNO:95) 
GlyArgGlyAspSerProAlaSer 

(SEQIDNO:96) 




Library 


X X X X X 


]SQ^JKNNK>^^ 

X X X X X (deletion) 




clonel 
(Ubi4) 


TCGAGGTTGCGGCGG 
(SEQ ID NO:97) 
SerArgLexiArgArg 
(SEQ ID NO:98) 


CCGCCGTGGAGGGTG 
(SEQIDNO:99) 
ProProTrpArgVal 
(SEQIDNO:100) 


9 


clone2 


GGTCAGCGAACTTTT 

(SEQIDNO:101) 

GlyGlnArgThrPhe 

(SEQ ID NO: 102) 


AGGCGGTGGTGGGCT 
(SEQ ID NO: 103) 
ArgArgTrpTrpAla 

(SEQ ID NO: 104) . 


1 
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clones 


GCGAGGTGGACGCTT 


AGGCGGTGGTGGTGG 


1 




(SEQIDNO:105) 


(SEQIDNO:107) 






AlaArgTrpThrLeu 


ArgArgTrpTrpTrp 






(SEOIDNO:106) 


(SEOIDNO:108) 





''N denotes an equimolar mixture of A, T, G and C; K denotes an equimolar mixture of G and T. 

5 A clone, dubbed Ubi4, dominated the enriched pool of Fn3 variants. Therefore, • 
further investigation was focused on this Ubi4 clone. lJbi4 contains four 
mutations in the BC loop (Arg 30 in the BC loop was conserved) and five 
mutations and three deletions in the FG loop. Thus 13% (12 out of 94) of the 
residues were altered in Ubi4 fi-om the wild-type sequence. 

1 0 Figure 1 5 shows a phage ELISA analysis of Ubi4. The Ubi4 phage binds 

to the target molecule, ubiquitin, with a significant affinity, while a phage 
displaying the wild-type Fn3 domain or a phase with no displayed molecules 
show little detectable bindmg to ubiquitin (Figure 15a). In addition, the Ubi4 
phage showed a somewhat elevated level of background binding to the control 

15 surface lacking the ubiquitin coating. A competition ELISA experiments shows 
the IC50 (concentration of the free hgand which causes 50% inhibition of 
binding) of the binding reaction is approximately 5 [xM (Fig. 15b). BSA, bovine 
ribonuclease A and cytochrome C show little inhibition of the Ubi4-ubiquitin 
binding reaction (Figure 15c), indicating that the binding reaction of Ubi4 to 

20 ubiquitin does result fi*om specific binding. 

Characterization of a Mutant Fn3 Protein 

The expression system yielded 50-100 mg Fn3 protein per liter culture. 
A similar level of protein expression was observed for the Ubi4 clone and other 
25 mutant Fn3 proteins. 

Ubi4-Fn3 was expressed as an independent protein. Though a majority 
of Ubi4 was expressed in E. coli as a soluble protein, its solubility was found to 
be significantly reduced as compared to that of wild-type Fn3. Ubi4 was soluble 
up to -20 [jlM at low pH, with much lower solubility at neutral pH. This 
30 solubility was not high enough for detailed structural characterization using 
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NMR spectroscopy or X-ray crystallography. 

The solubility of the Ubi4 protein was improved by adding a solubility 
tail, GKKGK (SEQ ID NO: 109), as a C-tenninal extension. The gene for Ubi4- 
Fn3 was subcloned into the expression vector pAS45 using PGR. The C- 
5 terminal solubilization tag, GKKGK (SEQ ID NO: 1 09), was incorporated in this 
step. E. coli BL21 (DE3) (Novagen) was transformed with the expression vector 
(pAS45 and its derivatives). Cells were grown in M9 minimal media and M9 
media supplemented with Bactotryptone (Difco) containing ampicillin (200 
Hg/ml). For isotopic labehng, '^N NH4CI replaced unlabeled NH4CI in the 
1 0 media. 500 ml medium in a 2 liter baffle flask was inoculated with 1 0 ml of 
overnight culture and agitated at SVC. IPTG was added at a jfinal concentration 
of 1 mM to initiate protein expression when OD (600 nm) reaches one. The 
cells were harvested by centrifiigation 3 hours after the addition of IPTG and 
kept frozen at -70*C until used. 
1 5 Proteins were purified as follows. Cells were suspended in 5 ml/(g cell) 

of Tris (50 mM, pH 7.6) containing phenyhnethylsulfonyl fluoride (1 mM). Hen 
egg lysozyme (Sigma) was added to a final concaitration of 0.5 mg/ml. After 
incubating the solution for 30 minutes at 37°C, it was sonicated three times for 
30 seconds on ice. Cell debris was removed by centrifiigation. Concentrated 
20 sodium chloride was added to the solution to a final concentration of 0.5 M. The 
solution was apphed to a Hi-Trap chelating column (Pharmacia) preloaded with 
nickel and equilibrated in the Tris buffer containmg sodium chloride (0.5 M). 
After washing the column with the buffer, histag-FnS was eluted with the buffer 
containing 500 mM imidazole. The protein was fijrther purified using a 
25 Resources column (Pharmacia) with a NaCl gradient in a sodium acetate buffer 

(20 mM, pH 4.6). 

With the GKKGK (SEQ ID NO:l 09) tail, the solubility of the Ubi4 
protein was increased to over 1 mM at low pH and up to -50 [xM at neutral pH. 
Therefore, further analyses were performed on Ubi4 with this C-terminal 
30 extension (hereafter referred to as Ubi4-K). It has been reported that the 

solubility of a minibody could be significantly improved by addition of three Lys 
residues at the N- or C-tennini (Bianchi et ai, 1994). In the case of protein Rop, 
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a non-structured C:-tenninal tail is critical in maintaining its solubility (Smith et 
al, 1995). 

Oligomerization states of the lJbi4 protein were determined using a size 
exclusion column. The wild-type Fn3 protein was monomeric at low and neutral 
5 pH's. However, the peak of the Ubi4-K protein was significantly broader than 
that of wild-type Fn3, and eluted after the wild-type protein. This suggests 
interactions between Ubi4-K and the colunrn material, precluding the use of size 
exclusion chromatography to determine the ohgomerization state of Ubi4. NMR 
studies suggest that the protein is monomeric at low pH. 

10 The Ubi4-K protein retained a binding affinity to ubiquitin as judged by 

ELISA (Figure 15d). However, an attempt to determine the dissociation constant 
using a biosensor (Affinity Sensors, Cambridge, U.K.) failed because of high 
background binding of Ubi4-K-Fn3 to the sensor matrix. This matrix mainly 
consists of dextran, consistent with the observation that interactions between 

1 5 lJbi4-K interacts with the cross-linked dextran of the size exclusion colunrn. 

Example XVn 
Stability Measurements of Monobodies 

Guanidine hydrochloride (GuHCl)-induced unfolding and refolding 
20 reactions were followed by measuring tryptophan fluorescence. Experiments 
were performed on a Spectronic AB-2 spectrofluorometer equipped with a 
motor-driven syringe (Hamilton Co.). The cuvette temperature was kept at SO^C. 
The spectrofluorometer and the syringe were controlled by a single computer 
using a home-built interface. This system automatically records a series of 
25 spectra following GuHCl titration. An experiment started with a 1.5 ml buffer 
solution containing 5 protein. An emission spectrum (300-400 mn; 
excitation at 290 nm) was recorded following a delay (3-5 minutes) after each 
injection (50 or 100 p,l) of a buffer solution containing GuHCL These steps were 
repeated until the solution volume reached the full capacity of a cuvette (3,0 ml). 
30 Fluorescence intensities were normalized as ratios to the intensity at aii 

isofluorescent point which was determined in separate experiments. Unfolding 
curves were fitted with a two-state model using a nonlinear least-squares routine 
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(Santoro & Bolen, 1988). No significant differences were observed between 
experiments with delay times (between an injection and the start of spectrum 
acquisition) of 2 minutes and 1 0 minutes, indicating that the mfolding/refolding 
reactions reached close to an equiUbrium at each concentration point within the 

5 delay times used. 

Conformational stability of Ubi4-K was measured using above-described 
GuHCl-induced unfolding method. The measurements were performed under 
two sets of conditions; first at pH 3.3 in the presence of 300 mM sodium 
chloride, where Ubi4-K is highly soluble, and second in TBS, which was used 
10 for library screening. Under both conditions, the unfolding reaction was 
reversible, and we detected no signs of aggregation or irreversible unfolding. 
Figure 16 shows unfolding transitions of Ubi4-K and wild-type Fn3 with the N- 
terminal (his)^ tag and the C-terminal solubility tag. The stability of wild-type 
Fn3 was not significantly affected by the addition of these tags. Parameters 
characterizing the unfolding transitions are listed in Table 8. 



15 



Table 8. Stability parameters for Ubi4 and wUd-type Fn3 as determined by 
GuHCl-induced unfolding 



20 



Protein 


AGo (kcal mol"') 


mo (kcal mol-' M'') 


Ubi4 (pH 7.5) 


4.8 ±0.1 


2.12 ±0.04 


Ubi4 (pH 3.3) 


6.5 ±0.1 


2.07 ± 0.02 


Wild-type (pH 7.5) 


7.2 ± 0.2 


1.60 ±0.04 


Wild-type (dH 3.3) 


11.2±0.1 


2.03 ± 0.02 



25 



AG. is the free energy of unfolding in the absence of denaturant; mo is the 
dependence of the free energy of unfolding on GuHCI concentration. For 
solution conditions, see Figure 4 caption. 

30 Though the introduced mutations in the two loops certainly decreased the 
stability of Ubi4-K relative to wild-type Fn3, the stability of Ubi4 remains 
comparable to that of a "typical" globular protein. It should also be noted that 
the stabilities of the wild-type and Ubi4-K proteins were higher at pH 3.3 than at 
pH 7.5. 

35 The Ubi4 protein had a significantly reduced solubihty as compared to 
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that of wild-type Fn3, but the solubility was improved by the addition of a 
solubility tail. Since the two mutated loops include the only differences between 
the wild-type and Ubi4 proteins, these loops must be the origin of the reduced 
solubility. At this point, it is not clear whether the aggregation of Ubi4-K is 
5 caused by interactions between the loops, or by interactions between the loops 
and the invariable regions of the Fn3 scaffold. 

The Ubi4-K protein retained the global fold of Fn3, showing that this 
scaffold can acconunodate a large number of mutations in the two loops tested. 
Though the stability of the Ubi4-K protein is significantly lower than that of the 

10 wild-type Fn3 protein, the Ubi4 protein still has a conforaiational stability 
comparable to those for small globular proteins. The use of a highly stable 
domain as a scaffold is clearly advantageous for introducing mutations without 
affecting the global fold of the scaffold. In addition, the GuHCl-induced 
unfolding of the Ubi4 protein is almost completely reversible. This allows the 

15 preparation of a correctly folded protein even when a Fn3 mutant is expressed in 
a misfolded form, as in inclusion bodies. The modest stability of lJbi4 in the 
conditions used for library screening indicates that Fn3 variants are folded on the 
phage surface. This suggests that a Fn3 clone is selected by its binding affinity 
in the folded form, not in a denatured form. Dickinson et hi proposed that Val 

20 29 and Arg 30 in the BC loop stabilize Fn3. Val 29 makes contact with the 

hydrophobic core, and Arg 30 forms hydrogen bonds with Gly 52 and Val 75. In 
Ubi4-Fn3, Val 29 is replaced with Arg, while Arg 30 is conserved. The FG loop 
was also mutated in the library. This loop is flexible in the wild-type structure, 
and shows a large variation in length among hximan Fn3 domains (Main et al, 

25 1992). These observations suggest that mutations in the FG loop may have less 
impact on stability. In addition, the N-terminal tail of Fn3 is adjacent to the 
molecular surface formed by the BC and FG loops (Figure 1 and 17) and does 
not form a well-defined structure. Mutations in the N-tenninal tail would not be 
expected to have strong detrimental effects on stability. Thus, residues in the N- 

30 terminal tail may be good sites for introducing additional mutations. 

Example XVIII 
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NMR Spectroscopy of Ubi4-Fn3 
Ubi4-Fn3 was dissolved in [^HJ-Gly HCl buffer (20 mM, pH 33) 
containing NaCl (300 mM) using an Amicon ultrafiltration unit. The final 
protein concentration was 1 mM. NMR expraiments were performed on a 
5 Varian Unity INOVA 600 spectrometer equipped with a triple-resonance probe 
with pulsed field gradient. Theprobetemperature was set at 30°C. HSQC, 
TOCSY-HSQC and NOESY-HSQC spectra were recorded using published, 
procedures (Kay et al, 1992; Zhang et al, 1994). NMR spectra were processed 
and analyzed using the NMRPipe and NMRView software (Johnson & Blevins, 
10 1 994; Delaglio et al. , 1 995) on UNIX workstations. Sequence-specific 

resonance assignments were made using standard procedures (Wiithrich, 1986; 
Clore & Groneribom, 1991). The assignments for wild-type Fn3 (Baron et al, 
1992) were confirmed using a '^N-labeled protein dissolved in sodium acetate 
buffer (50 mM, pH 4.6) at 30'C. 
1 5 The three-dimensional structure of Ubi4-K was characterized using this 

heteronuclear NMR spectroscopy method. A high quality spectrum could be 
collected on a 1 mM solution of '^N-labeled Ubi4 (Figure 17a) at low pH. The 
linewidth of amide peaks of Ubi4-K was similar to that of wild-type Fn3, 
suggesting that Ubi4-K is monomeric under the conditions used. Complete 
20 assignments for backbone 'H and '^N nuclei were achieved using standard 'H, 
'^N double resonance techniques, except for a row of His residues in the N- 
terminal (His)6 tag. There were a few weak peaks m the HSQC spectrum which 
appeared to originate fi-om a minor species containing the N-terminal Met 
residue. Mass spectroscopy analysis showed that a majority of Ubi4-K does not 
25 contain the N-temiinal Met residue. Fig. 1 7 shows differences in 'HN and '^N 
chemical shifts between Ubi4-K and wild-type Fn3. Only small differences are 
observed in the chemical shifts, except for those in and near the mutated EC and 
FG loops. These results clearly indicate that Ubi4-K retains the global fold of 
Fn3, despite the extensive mutations in the two loops. A few residues in the N- 
30 teiminal region, which is close to the two mutated loops, also exhiTiit significant 
chemical differences between the two proteins. An HSQC spectrum was also 
recorded on a 50 jiM sample of Ubi4-K in TBS. The spectrum was similar to 
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that collected at low pH, indicating that the global conformation of Ubi4 is 
maintained between pH 7.5 and 3.3. 

Example XIX: 

5 Stabilization of Fn3 domain by removing unfavorable electrostatic 

interactions on the protein surface 

Inti'oduction 

Increasing the conformational stability of a protein by mutation is a major 

1 0 interest in protein design and biotechnology. The three-dimensional stractures 
of proteins are stabilized by combination of different types of forces. The 
hydrophobic effect, van der Waals interactions and hydrogen bonds are known to 
contribute to stabilize the folded state of proteins (Kauzmann, W. (1959) Adv. 
Prot Cheim 14, 1-63; DUl, K. A. (1990) Biochemistiy 29, 7133-7155; Pace, C. 

1 5 N., Shirley, B. A., McNutt, M. & Gajiwala, K. (1996) Faseb J 10, 75-83). These 
stabilizing forces primarily originate from residues that are well packed in a 
protein, such as those that constitute the hydrophobic core. Because a change in 
the protein core would induce a rearrangement of adjacent moieties, it is difficult 
to improve protein stability by increasing these forces without massive 

20 computation (Malakauskas, S. M. & Mayo, S. L. (199S) Nat Stj-uct Biol 5, 470- 
475). Ion pairs between charged groups are commonly found on the protein 
surface (Creighton, T. E. (1993) Proteins: stJ^icUires and molecular properties. 
Freeman, New York), and an ion pair could be introduced to a protein with small 
stmctural perturbations. However, a number of studies have demonstrated that 

25 the introduction of an attractive electrostatic interaction, such as an ion pair, on 
protein surface has small effects on stability (Dao-pin, S., Sauer, U., Nicholson, 
H. & Matthews, B. W. (1991) Biochemistiy 30, 7142-7153; Sali, D., Bycroft, M. 
& Fersht, A. R. (1991) •/ MoL Biol 220, 779-788). A large desolvation penalty 
and the loss of conformational entropy of amino acid side chains oppose the 

30 favorable electrostatic contribution (Yang, A.-S. & Honig, B. (1992) Cim\ Opin, 
Stmct. Biol 2, 40-45; Hendsch, Z, S. & Tidor, B. (1994) Protein ScL 3, 21 1- 
226). Recent studies demonstrated that repulsive electrostatic interactions on the 
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protein surface, in contrast, may significantly destabilize a protein, and xhat it is 
possible to improve protein stability by optimizing surface electrostatic 
interactions (Loladze, V. V., Ibarra-Molero, B., Sanchez-Ruiz, J. M. & 
Makhatadze, G. I. (1999) Biochemistry 38, 16419-16423; Perl, D., Mueller, U., 
5 Heinemann, U. & Schmid, F. X. (2000) Nat Sti-uct Biol 7, 380-383; Spector, S., 
Wang, M, Caip, S. A., Robblee, J., Hendsch, Z. S, Fairman, R., Tidor, B. & 
Raleigh, D. P. (2000) Biochemistry 39, 872-879; Grimsley, G. R., Shaw, K. L., 
Fee, L. R., Alston, R. W., Huyghues-Despointes, B. M., Thurlkill, R. L., Scholtz, 
J. M. & Pace, C. N. (1999) Frotein Sci 8, 1843-1849). hi the present 
1 0 experiments, the inventor improved protein stability by modifying surface 
electrostatic interactions. 

During the characterization of monobodies it was found that these 
proteins, as well as wild-type FNfiilO, are significantly more stable at low pH 
than at neutral pH (Koide, A., Bailey, C. W., Huang, X. & Koide, S. (1998) J. 
15 Mol. Biol. 284, 1 141-1 151). These observations indicate that changes in the 
ionization state of some moieties in FNfiilO modulate the conformational 
stability of the protem, and suggest that it might be possible to enhance the 
conformational stability of FNfiilO at neutral pH by adjusting electrostatic 
properties of the protein. Improving the conformational stability of FNfiilO will 
20 also have practical importance in the use of FNfiilO as a scaffold m 
biotechnology apphcations. 

Described below are experiments that detailed characterization of the pH 
dependence of FNfiilO stability, identified unfavorable interactions between side 
chain carboxyl groups, and improved the conforaiational stability of FNfiil 0 by 
25 point mutations on the surface. The results demonstrate that the surface 

electrostatic interactions contribute significantiy to protem stability, and that it is 
possible to enhance protein stability by rationally modulating these interactions. 

Experimental Procedures 

30 Protein expression and purification 

The wild-type protein used for the NMR studies contained residues 1-94 
of FNfiilO (residue numbering is according to Figure 2(a) of Koide et al. (Koide, 
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A., Bailey, C. W., Huang, X. & Koide, S. (1998)/. MoL Biol 284, 1141-1151)), 
and additional two residues (Met-Gln) at the N-terminus (these two residues are 
numbered -2 and -1, respectively). The gene cx>ding for the protein was inserted 
in pET3a (Novagen, WI). Eschericha coli BL21 (DE3) transfonned with the 
5 expression vector was grown in the M9 minimal media supplemented with ^^C- 
glucose and ^^N-ammonium chloride (Cambridge Isotopes) as the sole carbon 
and nitrogen sources, respectively. Protein expression was induced as described 
previously (Koide, A., Bailey, C. W., Huang, X, & Koide, S. (1998)/ Mol Biol 
284, 1 141-1 151). After harvesting the cells by centrifiige, the cells were lysed as 

10 described (Koide, A., Bailey, C. W., Huang, X. & Koide, S. (1998)7. Mol Biol 
284, 1 141-1 151), After centrifiigation, supernatant was dialyzed against 10 mM 
sodium acetate buffer (pH 5.0), and the protein solution was appUed to a SP- 
Sepharose FastFlow coliimn (Amersham Pharmacia Biotech), and FN3 was 
eluted with a gradient of sodium chloride. The protein was concentrated using 

1 5 an Amicon concentrator using YM-3 membrane (Millipore). 

The wild-type protein used for the stability measurements contained an 
N-terminal histag (MGSSHHHHHHSSGLVPRGSH) (SEQ ID NO:l 14) and 
residues -2-94 of FNfalO. The gene for FN3 described above was inserted in 
pET15b (Novagen). The protein was expressed and purified as described 

20 (Koide, A., Bailey, C. W., Huang, X. & Koide, S. (1 998) /. Mol Biol 284, 
1141-1151). The wild-type protein used for measurements of the pH 
dependence shown in Figure 22 contained Arg 6 to Thr mutation, which had 
originally been introduced to remove a secondary thrombin cleavage site (Koide, 
A., Bailey, C. W., Huang, X. & Koide, S. (1998)/. Mol Biol 284, 1141-1151). 

25 Because Asp 7, which is adjacent to Arg 6, was found to be critical in the pH 
dependence of FN3 stability as detailed under Results, subsequent studies were 
performed using the wild-type, Arg 6, background. The genes for the D7N and 
D7K mutants were constructed using standard polymerase chain reactions, and 
inserted in pET15b. These proteins were prepared in the same maimer as for the 

30 wild-type protein. '^C, '^N-labeled proteins for ^K^ measurements were prepared 
as described above, and the histag moiety was not removed fi*om these proteins. 



wo 02/04523 



PCT/USOl/21855 



66 



Chemical denaturation measurements 

Proteins were dissolved to a final concentration of 5 jiM in 10 mM 
sodinm citrate buffer at various pH containing 100 mM sodium chloride. 
Guanidine HCl (GuHCl)-induce unfolding experiments were performed as 
5 described previously (Koide, A., Bailey, C. W., Huang, X. & Koide, S. (1998)7. 
Mol Biol. 284, 1 141-1 151 ; Koide, S., Bu, Z., Risal, D., Pham, T.-N., Nakagawa, 
T., Tamura, A. & Engehnan, D. M. (1999) Biochemist,y 38, 4757-4767). 
GuHCl concentration was determined using an Abbe refractometer (Spectronic 
Ir^ments) as described (Pace, C. N. & Sholtz, J. M (1997) in Protevr 
10 strucuire. A practical approach (Creighton, T. E., Ed.) Vol. pp299-321, IRL 
Press, Oxford). Data were analyzed according to the two-state model as 
described (Koide, A., Bailey, C. W., Huang, X. & Koide, S. (1998) Mol. Biol. 
284, 1141-1 151; Santoro, M. M. & Bolen, D. W. (1988) Biochemistry 27, 8063- 
8068.). 
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Thermal denaturation measurements 

Proteins were dissolved to a final concentration of 5 ^lM in 20 mM 
sodium phosphate buffer (pH 7.0) containing 0.1 or 1 M sodium chloride or in 
20 mM glycine HCl buffer (pH 2.4) containing 0.1 or 1 M sodium chloride. 
20 Additionally 6.3 M urea was included in all solutions to ensure reversibility of 
the thermal denaturation reaction, hi the absence of urea it was found that 
denatured FNfiil 0 adheres to quartz surface, and that the thermal denaturation 
reaction was irreversible. Circular dichroism measurements were performed 
using a Model 202 spectrometer equipped with a Peltier temperature controller 
25 (Aviv Instruments). A cuvette with a 0.5-cm pathlength was used. The 

ellipticity at 227 nm was recorded as the sample temperature was raised at a rate 
of approximately 1 °C per minute. Because of decomposition of urea at high 
temperature, the pH of protein solutions tended to shift upward during an 
experiment. The pH of protein solution was measured before and after each 
30 thermal denaturation measurement to ensure that a shift no more than 0.2 pH 
unit occurred in each measurement. At pH 2.4, two sections of a themial 
denaturation curve (30-65 °C and 60-95 °C) were acquired firom separate 
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samples, in order to avoid a large pH shift. The thermal denaturation data were 
fit with the standard two-state model (Pace, C. N. & Sholtz, J. M. (1997) in 
Protein structure. A practical approach (Creighton, T. E., Ed.) Vol. pp299-321, 
IRL Press, Oxford): 

5 

AG(T) = AH„{,l-T/TJ-ACp[(T^ -T) + T\n{T IT^)] 

where AG(T) is the Gibbs free aiergy of unfolding at temperature T, AH^ is the 
enthalpy change upon imfolding at the midpoint of the transition, Tn,, and ACp is 

1 0 the heat capacity change upon unfolding. The value for ACp was fixed at 1 .74 
kcal mol ' K"', according to the approximation of Myers et al. (Myers, J. K., 
Pace, C. N. & Scholtz, J. M. (1995) Protein Sci. 4, 2138-2148). Most of the 
datasets taken in the presence of 1 M NaCl did not have a sufiBcient baseline for 
the unfolded state, and thus it was assumed the slope of the unfolded baseline in 

1 5 the presence of 1 M NaCl to be identical to that determined in the presence of 
0.1 MNaCl. 

spectroscopy 

NMR experiments were performed at 30 °C on an INOVA 600 
20 spectrometer (Varian Instruments). The C(CO)NH experiment (Grzesiek, S., 
Anglister, J. & Bax, A. (1993) J. Magn. Reson. B 101, 1 14-1 19) and the 
CBCACOHA experiment (Kay, L. E. (1993)/. Am. Chem. Soc. 115, 2055-2057) 
were collected on a ["C, '^N]-wild-type FNfiilO sample (1 mM) dissolved in 
50 mM sodium acetate buffer (pH 4.6) containing 5 % (v/v) deuterium oxide, 
25 using a Varian 5 mm triple resonance probe with pulsed field gradient. The 
carboxyl '^C resonances were assigned based on the backbone 'H, '^C and '^N 
resonance assignments of FNfiilO (Baron, M., Main, A. L., DriscoU, P. C, 
Mardon, H. J., Boyd, J. & Campbell, I. D. (1992) Biochemistry 31, 2068-2073). 
pH titration of carboxyl resonances were performed on a 0.3 mM FNfiilO sample 
30 dissolved in 10 mM sodium citrate containing 100 mM sodium chloride and 5 % 
(v/v) deuteriimi oxide. An 8 mm triple-resonance, pulse-field gradient probe 
(Nanolac Corporation) was used for pH titration. Two-dimensional H(C)CO 
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spectra were collected using the CBCACOHA pulse sequence as described 
previously (Mcintosh, L. P., Hand, G., Johnson, P. E., Joshi, M. D., Koemer, M., 
Plesniak, L. A., Ziser, L., Wakarchuk, W. W. & Withers, S. G. (1996) 
Biochemistry 55, 9958-9966). Sample pH was changed by adding small aUquots 

5 of hydrochloric add, and pH was measured before and after taking NMR data. 
'H, '^N-HSQC spectra were taken as described previously (Kay, L. E., Keifer, P. 
& Saarinen, T. (1992)/. Am. Chem. Soc. 114, 10663-10665). NMR data were 
processed using the NMRPipe package (Delaglio, F., Grzesiek, S., Vuister, G. 
W., Zhu, G., Pfeifer, J. & Bax, A. (1995) J. Biomol. MdR 6, 277-293), and 

10 analyzed using the NMRView software (Johnson, B. A. & Blevins, R. A. (1994) 
J. Biomol. NMR 4, 603-614). 

NMR titration curves of the carboxyl '^C resonances were fit to the 
Henderson-Hasselbalch equation to determine ^K^s,: 

S{pH) = {5^., + ^..10^^^-^">) / (1 + 10^^"-^^'"^) 
1 5 where 6 is the measured chemical shift, 6,,^ is the chemical shift associated with 
the protonated state, b^, is the chemical shift associated with the deprotonated 
state, and pi^„ is the value for the residue. Data were also fit to an equation 
with two ionizable groups: 



(I+IO^^^"-^^'"^ +10^^^^"^^'""'^^''"^) 

where 6^2, ^ah and 6^ are the chemical shifts associated with the fiilly 
protonated, singularly protonated and deprotonated states, respectively, and pK^,, 
and pK^ are piiT^'s associated with the two ionization steps. Data fitting was 
25 performed using the nonlinear least-square regression method in the program 
Igor Pro (WaveMetrix, OR) on a Macintosh computer. 



Results 

pH Dependence of FNfnlO stability 
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Previously, it was found that FN&IO is more stable at acidic pH than at 
neutral pH (Koide, A., Bailey, C. W., Huang, X. & Koide, S. (1998) J. Mol Biol 
284, 1141-1151). In the present experiments, the pH dependence of its stability 
was further characterized. Because of its high stability, FNfiil 0 could not be 
5 fully denatured in urea at 30 °C. Thus GuHCl-induced chemical denaturation 
(Figure 1 8) was used. The denaturation reaction was fully reversible under all 
conditions tested. In order to minimize errors caused by extrapolation, the free 
energy of unfolding at 4 M GuHCl was used for comparison (Figure 1 8), The 
stability increased as the pH was lowered, with apparent plateaus at both ends of 

10 the pH range. The pH dependence curve has an apparent transition midpoint 
near pH 4. In addition, a gradual increase in the 77z value, the dependence -of the 
unfolding free energy on denaturant concentration was noted. Pace et al 
reported a similar pH dependence of the m value for bamase (Pace, C. N., 
Laurents, D. V. & Erickson, R. E. (1992) Biochemistiy 31, 2728-2734). These 

1 5 results indicate that FNfiil 0 contains interactions that stabilize the protein at low 
pH, or those that destabilize it at neutral pH. The results also suggest that by 
identifying and altering the interactions that give rise to the pH dependence, one 
may be able to improve the stability of FNfrilO at neutral pH to a degree similar 
to that found at low pH. 

20 

Deteiniination of pKJs of the side chain carboxyl gj-oups in wild-type FNfnlO 
The pH dependence of FNfiil 0 stability suggests that amino acids with 
pK^ near 4 are involved in the observed transition. The carboxyl groups of Asp 
and Glu generally have p^^ in this range (Creighton, T. E. (1993) Proteins: 

25 st}-uctures and molecular propej'ties. Freeman, New York). It is well known that 
if a carboxyl group has unfavorable (i.e. destabilizing) interactions in the folded 
state, its p^^ is shifted to a higher value from its unperturbed value (Yang, A>S. 
8i Honig, B. (1992) Cwrr. Opin, Stnict Biol 2, 40-45). If a carboxyl group has 
favorable interactions in the folded state, it has a lower ^K^. Thus, tlae pZ^ 

30 values of all carboxylates in FNfiilO using heteronuclear NMR spectroscopy 
were determined in order to identify stabilizing and destabilizing interactions 
involving carboxyl groups. 
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First, the "C resonance for the carboxyl carbon of each Asp and Glu 
residue in FN3 was assigned (Figure 1 9). Next, pH titration of the '^C 
resonances for these groups was performed (Figure 20). Titration curves for Asp 
3, 67 and 80, and Glu 38 and 47 could be fit well with the Henderson- 

5 Hasselbalch equation with a single pK^ The pX^ values for these residues (Table 
9) are either close to or slightly lower than their respective unperturbed values 
(3.8-4.1 for Asp, and 4.1-4.6 for Glu (Kuhlman, B., Luisi, D. L., Young, P. & 
Raleigh, D. P. (1999) Biochemistry 38, 4896-4903)), indicating that these 
carboxyl groups are involved in neutral or slightly favorable electrostatic 

1 0 interactions in the folded state. 



Table 9. pK„ values for Asp and Glu residues in FN3'. 

Residue Protein 







Wild-Type 


D7N 


D7K 


15 


E9 


3.84, 5.40^ 


4.98 


4.53 




E38 


3.79 


3.87 


3.86 




E47 


3.94 


3.99 


3.99 




D3 


3.66 


3.72 


3.74 




D7 


3.54, 5.542 






20 


D23 


3.54, 5.25^ 


3.68 


3.82 




D67 


4.18 


4.17 


4.14 




D80 


3.40 


3.49 


3.48 



'The standard deviations in the pK^ values are less than 0.05 pH units for those fit with a single 
pX^ and less than 0.15 pH unit for those with two pfiC^'s. 
25 ^Data for E9, D7 and D23 were fit with a transition curve with two p^^ values. 

The titration curves for Asp 7 and 23, and Glu 9 were fit better with the 
Henderson-Hasselbalch equation with two pK^ values, and one of the two pK^ 
values for each were shifted higher than the respective unperturbed values (Figure 
30 1 9B). The titration curves with two apparent pK^ values of these carboxyl groups 
may be due to influence of an ionizable group in the vicinity. In the three- 
dimensional structure of FNfiilO (Main, A. L., Harvey, T. S., Baron, M., Boyd, J. 
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& Campbell, I. D, (1992) Cell 71, 671-678), Asp 7 and 23, and Glu 9 fonn a 
patch on the surface (Figure 21), with Asp 7 centrally located in the patch. Thus, 
it is reasonable to expect that these residues influence each other*s ionization 
profile. In order to identify which of the three residues have a highly upshifted 
5 pK^, the H(C)CO spectrum of the protein in 99 % D2O buffer at pH* 5.0 (direct 
pH meter reading) was then collected. Asp 23 and Glu 9 showed larger 
deuterium isotope shifts (0.33 and 0.32 ppm, respectively) than Asp 7 (0.1 8 ppm). 
These results show that Asp 23 and Glu 9 are protonated to a greater degree than 
Asp 7, Thus, we concluded that Asp 23 and Glu 9 have highly upshifted pKJs, 
10 due to strong influence of Asp 7. 

Mutational analysis 

The spatial proximity of Asp 7 and 23, and Glu 9 explains the unfavorable 
electrostatic interactions in FNfhlO identified in this study. At low pH where 

1 5 these residues are protonated and neutral, the repulsive interactions are expected 
to be mostly relieved. Thus, it should be possible to improve the stability of 
FNfiilO at neutral pH, by removing the electrostatic repulsion between these three 
residues. Because Asp 7 is centrally located among the three residues, it was 
decided to mutate Asp 7. Two mutants, D7N and D7K were prepared. The 

20 former neutralizes the negative charge with a residue of virtually identical size. 
The latter places a positive charge at residue 7 and increases the size of the side 
chain. 

The ^H, *^N-HSQC spectra of the two mutant proteins were nearly 
identical to that of the wild-type protein, indicating that these mutations did not 

25 cause large structural perturbations (data not shown). The degrees of stability of 
the mutant proteins were then characterized using thermal and chemical 
denaturation measurements. Thermal denaturation measurements were 
performed initially with 100 mM sodium chloride, and 6.3 M urea was included 
to ensure reversible denaturation and to decrease the temperature of the thermal 

30 transition. All the proteins were predominantly folded in 6.3 M urea at room 
temperature. All the proteins underwent a cooperative transition, and the two 
mutants were found to be significantly more stable than the wild type at neutral 
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pH (Figure 22 and Table 10). Furthermore, these mutations almost eliminated the 
pH dependence of the conformational stability of FNfiilO. These results 
confirmed that destabilizmg interactions involving Asp 7 in wild-type FNfiilO at 
neutral pH are the primary cause of the pH dependence. 

Table 10. The midpoint of thermal denaturation (in °Q of wild-type and 
mutant FN3 in the presence of 63 M urea. 



Protein 


pH2.4 


pH7.0 




0.1 MNaCl 


IMNaCl 


0.1 MNaCl 


IMNaCl 


wild type 


72 


82 


62 


70 


D7N 


68 


82 


69 


80 


D7K 


69 


77 


70 


78 



10 



15 



The error in the midpoints for the 0.1 M NaCl data is ± 0.5 "C. Because most of the IM NaCI 
data did not have a sufficient baseline for the denatured state, the error in the midpoints for Aese 
data was estimated to be ±2 °C. 



The effect of increased sodium chloride concentration on the 
conformational stability of the wild type and the two mutant proteins was next 
20 investigated. All proteins were more stable in 1 M sodium chloride than in 0.1 M 
sodium chloride (Figure 22). The mcrease of the sodium chloride concentration 
elevated the T„ of the mutant proteins by approximately 10 °C at both acidic and 
neutral pH (Table 10). Remarkably the wild-type protein was also equally 
stabilized at both pH, although it contains unfavorable interactions among the 
25 carboxyl groups at neutral pH but not at acidic pH. 

Chemical denaturation of FNfiilO proteins was monitored using 
fluorescence emission from the single Trp residue of FNfiilO (Figure 23). The 
free energies of unfolding at pH 6.0 and 4 M GuHCl were determined to be 1.1 (± 
0.3), 1 .7 (± 0.2) and 1 .4 (± 0.1) kcal/mol for the wild type, D7N and D7K, 
30 respectively, indicating that the two mutations also increased the conformational 
stability against chemical denaturation. 
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Determination of the pKJs of the side chain carboxyd groups in the mutant 
proteins 

The ionization properties of carboxyl groups in the two mutant proteins 
was investigated. The 2D H(C)CO spectra of the mutant proteins at the high and 
5 low ends of the pH titration (pH -7 and -1 .5, respectively) were nearly identical 
to the respective spectra of the wild type, except for the loss of the cross peaks for 
Asp 7 (data not shown). This similarity allowed for an unambiguous assignment 
of resonances of the mutants, based on the assignments for wild-type FNfiil 0. 
The pH titration experiments revealed that, except for Glu 9 and Asp 23, the 

10 behaviors of Asp and Glu carboxyl groups are very close to their counterparts in 
the wild-type protein (Figure 24 Panels A, C, D, F and G, and Table 9), indicating 
that the two mutations have marginal effects on the electrostatic environments for 
these carboxylates. ha contrast, the titration curves for E9 and D23 show 
significant changes upon mutation (Figure 24 Panels B and E). The -gK^ of D23 

1 5 was lowered by more than 1 .6 and 1 A pH units in the D7N and D7K mutants, 
respectively. These results clearly show that the repulsive interaction between D7 
and D23 contributes to the increase in pX^ of Asp 23 in the wild-type protein, and 
that it was eliminated by the neutralization of the negative charge at residue 1. 
The pK^ of Glu 9 was reduced by 0.4 pH unit by the D7N mutation, while it was 

20 decreased by 0.8 pH units in the D7K mutant. The greater reduction of Glu 9 p^^ 
by the D7K mutation suggests that there is a favorable interaction between Lys 7 
and Glu 9 in this mutant protein. 

Discussion 

25 The present inventor has identified unfavorable electrostatic interactions 

in FNfiilO, and improved its conformational stability by mutations on the protein 
surface. The results demonstrate that repulsive interactions between like charges 
on protein surface significantly destabilize a protein. The results are also 
consistent with recent reports by other groups (Loladze, V. V., Ibarra-Molero, B., 

30 Sanchez-Ruiz, J. M. 8c Makhatadze, G. I. (1999) Biochemistjy 38, 1 6419- 1 6423; 
Perl, D., Mueller, U., Heinemann, U. & Schmid, F. X. (2000) Nat Stj-uct Biol 7, 
380-383; Spector, S., Wang, M., Carp, S. A., Robblee, J., Hendsch, Z. S., 
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Fainnan, R., Tidor, B. & Raleigh, D. P. (2000) Biochemistiy 39, 872-879; 
Grimsley, G. R., Shaw, K. L., Fee, L. R., Alston, R. W., Huyghues-Despointes, B. 
M., Thurlkill, R. L., Scholtz, J. M. & Pace, C. N. (1999) Protein Sci 8, 1 843- 
1849), in which protein stability was improved by eliminating unfavorable 

5 electrostatic interactions on the surface. In these studies, candidates for mutations 
were identified by electrostatic calculations (Loladze, V. V., Ibarra-Molero, B., 
Sanchez-Ruiz, L M. & Makhatadze, G. L (1999) Biochemistry 55, 16419-16423; 
Spector, S., Wang, M., Carp, S. A., Robblee, J., Hendsch, Z. S., Fairman, R., 
Tidor, B. & Raleigh, D. P. (2000) Biochemistiy 39, 872-879; Grimsley, G. R., 
10 Shaw, K. L., Fee, L. R, Alston, R. W., Huyghues-Despointes, B. M., Thurlkill, R. 
L., Scholtz, J. M. & Pace, C N. (1999) Protein Sci 5, 1843-1849) or by sequence 
comparison of homologous proteins with different stability (Perl, D., Mueller, U., 
Heinemann, U. & Schmid, F. X. (2000) Nat Sti-uctBiol 7, 380-383). The present 
strategy using ^K^ determination using NMR has both advantages and 

1 5 disadvantages over the other strategies. The present method directly identifies 
residues that destabilize a protein. Also it does not depend on the availability of 
the high-resolution structure of the protein of interest. Electrostatic calculations 
may have large errors due to the flexibility of amino acid side chains on the 
surface, and the uncertainty in the dielectric constant on the protein surface and in 

20 the protein interior. For example, in the NMR stmcture of FNfiil 0 (Main, A. L,, 
Harvey, T. S., Baron, M., Boyd, J, & Campbell, L D. (1992) Ce// 77, 671-678), 
the root mean squared deviations among 16 model structures for the atom of 
Glu residues are 1.2-2.4 A, and those for Lys atoms are 1.5-3.1 A. Such 
uncertainties in atom position can potentially cause large differences in 

25 calculation results. On the other hand, the present strategy requires the NMR 
assignments for carboxyl residues, and NMR measurements over a wide pH 
range. Although recent advances in NMR spectroscopy have made it 
straightforward to obtain resonance assignments for a small protein, some 
proteins may not be sufficiently soluble over the desired pH range. In addition, 

30 knowledge of the ^K^ values of ionizable groups in the denatured state is 
necessary for accurately evaluating contributions of individual residues to 
stability (Yang, A.-S. & Honig, B. (1992) Cuit. Opin. Stnct, Biol 2, 40-45). 
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Kuhlman et al (Kiohlman, B,, Luisi, D. L., Yoxing, P. & Raleigh, D. P. (1999) 
Biochemistry 38, 4896-4903) showed that p^^s of carboxylates in the denatured 
state has a considerably large range than those obtained from small model 
compounds. Despite these hmitations, the present method is applicable to many 
5 proteins. 

The inventor showed that the unfavorable interactions involving the 
carboxyl groups of Asp 7, Gin 9 and Asp23 were no longer present if these 
groups are protonated at low pH or if Asp 7 was replaced with Asn or Lys. The 
similarity in the measured stability of the mutants and the wild type at low pH 

1 0 (Table 1 0) suggests that no other factors significantly contribute to the pH 

dependence of FN&IO stability and that the mutations caused minimal structural 
perturbations. The little structural perturbation was expected, since the carboxyl 
groups of these three residues are at least 50 % exposed to the solvent, based on 
the solvent accessible surface area calculation on theNMR structure (Main, A. L., 

15 Harvey, T. S., Baron, M., Boyd, J. & Campbell, I. D, (1992) Cell 71, 671-678). 

The difference in thermal stability of the wild-type protein between acidic 
and neutral pH persisted in 1 M sodium chloride (Table 10). Likewise, the wild- 
type protein exhibited a large pH-dependence in stability in 4 M GuHCl (Figure 
1 8). Furthermore, upon the increase in the sodium chloride concentration from 

20 0. 1 to 1 .0 M, the T^ of the wild-type and mutant proteins all increased by ~1 0 °C, 
which is in the same magnitude as the change in T^ of the wild type by the pH 
shift. These data indicate that the unfavorable interactions identified in this study 
were not effectively shielded in 1 M NaCl or in 4 M GuHCl. Because the effect 
of increased sodium chloride was unifonn, this stabilization effect of sodium 

25 chloride is likely due to the nonspecific salting-out effect (Timasheff, S. N. 
(1992) Cun\ Op, Stimct Biol 2, 35-39). Other groups also reported little 
shielding effect of salts on electrostatic interactions (Perutz, M. F., Gronenbom, 
A. M., Clore, G. M., Fogg, J. H. & Shih, D. T. (1985) J Mol Biol 755, 491-498; 
Hendsch, Z. S., Jonsson, T., Sauer, R. T. & Tidor, B. (1996) Biochemistiy 55, 

30 7621-7625). Electrostatic interactions are often thought to diminish with 

increasing ionic strength, particularly if the site of interaction is highly exposed. 
Accordingly, the present data at neutral pH (Table 10) showing no difference in 
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the salt sensitivity between the wild type and the mutants could be interpreted as 
Asp 7 not being responsible for destabilizing electrostatic interactions. Although 
the reason for this salt insensitivity is not yet clear, the present results provide a 
cautionary note on concluding the presence and absence of electrostatic 
5 interactions solely based on salt concentration dependence. 

The carboxyl triad (Asp 7 and 23, and Glu 9) is highly conserved in " 
FNfhl 0 from nine different organisms that were available in the protein sequence 
databank at National Center for Biotechnology hiformation 
(www.ncbi.nhn.nih.gov). In these FN&IO sequences, Asp 9 is conserved except 
1 0 one case where it is replaced with Asn, and Glu 9 is completely conserved. The 
position 23 is either Asp or Glu, preserving the negative charge. As was 
discovered in this study, the interactions among these residues are destabiUzing. 
Thus, their high conservation, despite their negative effects on stability, suggests 
that these residues have functional importance in the biology of fibronectin. to 
1 5 the structure of a four-FN3 segment of human fibronefctm (Leahy, D. J., Aukhil, I. 
& Erickson, H. P. (1996) Cell 84, 155-164), these residues are not directly 
involved in interactions with adjacent domains. Also these residues are located on 
the opposite face of FN&IO from the integrin-binding RGD sequence in the FG 
loop (Figure 21). Therefore, it is not clear why these destabilizing residues are 
20 ahnost completely conserved in FNfiil 0. In contrast, no other FN3 domains in 
human fibronectin contain this carboxyl triad (for a sequence alignment, see ref 
Main, A. L., Harvey, T. S., Baron, M., Boyd, J. & Campbell, I. D. (1992) Cell 71, 
671-678). The carboxyl triad of FNfialO may be involved in important 
interactions that have not been identified to date. 
25 Clarke et al. (Clarke, J., Hamill, S. J. & Johnson, C. M. (1997) JMol Biol 

270, 771-778) reported that the stability of the third FN3 of human tenascin 
(TNfiiS) increases as pH was decreased from 7 to 5. Although they could not 
perform stability measurements below pH 5 due to protein aggregation, the pH 
dependence of TNfii3 resembles that of FNfiilO shown in Figure 18. TNfii3 does 
30 not contain the carboxylate triad at positions 7, 9 and 23 (Leahy, D. J., 

Hendrickson, W. A., Aukhil, I. & Erickson, H. P. (1992) Science 258, 987-991), 
indicating that the destabilization of TNfii3 at neutral pH is caused by a different 
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mechanism from that for FNfiilO. A visual inspection of the TNfii3 structure 
revealed that it has a large number of carboxyl groups, and that Glu 834 and Asp 
850 (numbering according to ref Leahy, D. J., Hendrickson, W. A., Aukhil, I. & 
Erickson, H. P. (1992) Science 258, 987-991) forms a cross-strand pair. It will be 
5 interesting to exanoine whether altering this pair can increase the stability of 
TNfiiS. 

In conclusion, a strategy has been described to experimentally identify 
unfavorable electrostatic interactions on the protein surface and improve the 
protein stability by relieving such interactions. The present results have 

1 0 demonstrated that forming a repulsive interaction between carboxyl groups 

significantly destabilize a protein. This is in contrast to the small contributions of 
forming a solvent-exposed ion pair. Unfavorable electrostatic interactions on the 
surface seem quite common in natural proteins. Therefore, optimization of the 
surface electrostatic properties provides a generally applicable strategy for 

1 5 increasing protein stability (Loladze, V, V., Ibarra-Molero, B., Sanchez-Ruiz, J. 
M, & Makhatadze, G. 1. (1999) Biochemistiy 38, 16419-16423; Perl, D., Mueller, 
U., Heinemann, U. & Schmid, F. X. (2000) Nat Stj^ct Biol 7, 380-383; Spector, 
S., Wang, M., Carp, S. A., Robblee, J., Hendsch, Z. S., Fairman, R,, Tidor, B. & 
Raleigh, D. P. (2000) Biochemistiy 39, 872-879; Grimsley, G. R., Shaw, K. L., 

20 Fee, L. R., Alston, R. W., Huyghues-Despointes, B. M., Thurlkill, R. L., Scholtz, 
J. M. & Pace, C. N. (1999) Protein Sci 5, 1843-1849). hi addition, repulsive 
interactions between carboxylates can be exploited for destabilizing undesirable, 
alternate conformations in protein design ("negative design"). 

25 EXAMPLE XX 

An extension of the carboxyl-terminus of the monobody scaffold 

The wild-type protein used for stability measurements is described imder 
Example 1 9. The carboxyl-terminus of the monobody scaffold was extended by 
four amino acid residues, namely, amino acid residues (Glu-Ile-Asp-Lys) (SEQ 
30 ID NO: 1 1 9), which are the ones tliat immediately follow FN&l 0 of human 

fibronectin. The extension was introduced into the FNfelO gene using standard 
PGR methods. Stability measurements were performed as described under 
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Example 19. The free energy of unfolding of the extended protein was 7.4 kcal 
mol ' at pH 6.0 and 30 °C, veiy close to that of the wild-type protein (7.7 kcal 
mol-'). These results demonstrate that the C-tenninus of the monobody scaffold 
can be extended without decreasing its stability. . 

5 

The complete disclosure of all patents, patent documents and pubhcations 
cited herein are incorporated by reference as if individually incorporated. The 
foregoing detailed description and examples have been given for clarity of 
understanding only. No unnecessary limitations are to be understood therefrom. 
10 The mvention is not limited to the exact details shown and described for 

variations obvious to one skilled in the art will be included within the invention 
defined by the claims. 
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WHAT IS CLAIMED IS: 

1. A fibronectin type HI (Fn3) molecule, wherein the Fn3 comprises a 
stabilizing mutation as compared to a wild-type Fn3. 

2. The Fn3 of claim 1, wherein the stabilizing mutation comprises at least 
one aspartic acid (Asp) residue that has been deleted or substituted with at 
least one other amino acid residue. 



The Fn3 of claim 2, wherein Asp 7 or Asp 23, or both, have been deleted 
or substituted with at least one other amino acid residue. 

The Fn3 of claim 3, wherein Asp 7 or Asp 23, or both, have been 
substituted with an asparagine (Asn) or lysine (Lys) residue. 

The Fn3 of claim 1, wherein the stabiUzing mutation comprises at least 
one glutamic acid (Glu) residue that has been deleted or substituted with 
at least one other amino acid residue. 

The Fn3 of claim 5, wherein Glu 9 has been deleted or substituted with at 
least one other amino acid residue. 

The Fn3 of claim 6, wherein Glu 9 has been substituted with an 
asparagine (Asn) or lysine (Lys) residue. 

The Fn3 of claim 2, wherein Asp 7, Asp 23, and Glu 9 have been deleted 
or substituted with at least one other amino acid residue. 

A fibronectin type III (Fn3) polypeptide monobody comprising a plurality 
of Fn3 p-strand domain sequences that are linked to a plurality of loop 
region sequences, 

wherein one or more of the monobody loop region sequences vary 
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by deletion, insertion or replacement of at least two amino acids from the 
corresponding loop region sequences in wild-type Fn3; 

wherein the P-strand domains of the monobody have at least a 
50% total amino acid sequence homology to the corresponding amino acid 
sequence of wild-type FnS's P-strand domain sequences; and 

wherein the Fn3 comprises a stabilizing mutation. 

1 0. An isolated nucleic acid molecule encoding the Fn3 molecule of claim 9. 

1 L An expression vector comprising an expression cassette operably linked to 
the nucleic acid molecule of claim 10. 

12. A host cell comprising the vector of claim 1 1 . 

13. The monobody of claim 9, wherein at least one loop region is capable of 
binding to a specific binding partner (SBP) to form a polypeptide: SBP 
complex having a dissociation constant of less than 10'^ moles/liter. 

14. The monobody of claim 9, wherein at least one loop region is capable of 
catalyzing a chemical reaction with a catalyzed rate constant (k^) and an 
uncatalyzed rate constant (k„„^J such that the ratio of k,3t/lq,„,3, is greater 
than 10. 

15. The monobody of claim 9, wherein one or more of the loop regions 
comprise amino acid residues: 

i) from 15 to 16 inclusive in an AB loop; 

ii) from 22 to 30 inclusive in a BC loop; 

iii) from 39 to 45 inclusive in a CD loop; 

iv) from 51 to 55 inclusive in a DE loop; 

v) from 60 to 66 inclusive in an EF loop; and 

vi) from 76 to 87 inclusive in an FG loop. 
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1 6. The monobody of claim 9, wherein the monobody loop region sequences 
vary from the wild-type Fn3 loop region sequences by the deletion or 
r^lacement of at least 2 amino adds. 

17. The monobody of claim 9, wherein the monobody loop region sequences 
vary from the wild-type Fn3 loop region sequences by the insertion of 
from 3 to 25 amino acids. 

1 8. An isolated nucleic acid molecule encoding the polypeptide monobody of 
claim 1. 

19. An expression vector comprising an expression cassette operably linked to 
the nucleic acid molecule of claim 18. 

20. The expression vector of claim 19, wherein the expression vector is an 
Ml 3 phage-based plasmid. 

21 . A host cell comprising the vector of claim 1 9. 

22. A me&od of preparing a fibronectin type ffl (Fn3) polypeptide monobody 
comprising the steps of: 

a) providing a DNA sequence encoding a plurality of Fn3 p-strand 
domain sequences that are linked to a plurality of loop region 
sequences, wherein at least one loop region contains a unique 
restriction enzyme site, and wherein at least one of the plurality of 
Fn3 p-strand domain sequences are more stable at neutral pH than 
wild-type Fn3; 

b) cleaving the DNA sequence at the unique restriction site; 

c) inserting into the restriction site a DNA segment known to encode 
a pqjtide capable of binding to a specific binding partner (SBP) or 
a transition state analog compound (TS AC) so as to yield a DNA 
molecule comprising the insertion and the DNA sequence of (a); 
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and 

d) expressing the DNA molecule so as to yield polypeptide 
monobody. 

23. A method of preparing a fibronectin type IE (Fn3) polypeptide monobody 
comprising the steps of: 

(a) providing a replicatable DNA sequence encoding a plurality of 
Fn3 (J-strand domain sequences that are linked to a plurality of 
loop region sequences, wherein the nucleotide sequence of at least 
one loop region is known, and wherein at least one of the plurality 
of FnS P-strand domain sequences are more stable at neutral pH 
than wild-type Fn3 ; 

(b) preparing polymerase chain reaction (PGR) primers sufBciently 
complementary to the known loop sequence so as to Be 
hybridizable under PGR conditions, wherein at least one of the 
primers contains a modified nucleic acid sequence to be inserted 
into the DNA; 

(c) performing polymerase chain reaction using the DNA sequence of 
(a) and the primers of (b); 

(d) annealing and extending the reaction products of (c) so as to yield 
a DNA product; and 

(e) expressing the polypeptide monobody encoded by the DNA 
product of (d). 

24. A method of preparing a fibronectin type III (Fn3) polypeptide monobody 
comprising the steps of: 

a) providing a replicatable DNA sequence encoding a plurality of 
Fn3 p-strand domain sequences that are linked to a plurality of 
loop region sequences, wherein the nucleotide sequence of at least 
one loop region is known, and wherein at least one of the plurality 
of Fn3 P-strand domain sequences are more stable at neutral pH 
than wild-type FnS ; 
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b) perforaiing site-directed mutagenesis of at least one loop region , 
' so as to create a DNA sequence comprising an insertion mutation; 
and I 

i 

o) expressing the polypeptide monobody encoded by the DNA I 

j 

sequence comprising the insertion mutation. j 

25. A kit for performing the method of any one of claims 22-24, comprising a 
replicatable DNA encoding a plurality of Fn3 P-strand domain sequences 
that are linked to a plurality of loop region sequences, wherein at least one 

of the plurality of Fn3 (J-strand domain sequences are more stable at i 

I 

neutral pH than wild-type Fn3. I 

i 

! 

f 

26. A variegated nucleic acid library encoding Fn3 polypeptide monobodies 
comprising a plurality of nucleic acid species each comprising a plurality 
of loop regions, wherein the species encode a plurality of Fn3 p-strand 
domain sequences that are linked to a plurahty of loop region sequences, 

wherein one or more of the loop region sequences vary by 
deletion, insertion or replacement of at least two amino acids from 
corresponding loop region sequences in wild-type Fn3; 

wherein the p-strand domain sequences of the monobody have at 
least a 50% total amino acid sequence homology to the corresponding 
amino acid sequences of (i-strand domain sequences of the wild-type Fn3; 
and 

wherein the Fn3 is more stable at neutral pH than wild-type Fn 



27. The variegated nucleic acid library of claim 26, wherein one or more of 
the loop regions encodes: 

i) an AB amino acid loop from residue 1 5 to 1 6 inclusive; 

ii) a BC amino acid loop from residue 22 to 30 inclusive; 

iii) a CD amino acid loop from residue 39 to 45 inclusive; 

iv) a DE amino acid loop from residue 51 to 55 inclusive; 

v) an EF amino acid loop from residue 60 to 66 inclusive; and 
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vi) an FG amiao acid loop from residue 76 to 87 inclusive. 

28. The variegated nucleic acid library of claim 26, wherein the loop region 
sequences vary from the wild-type Fn3 loop region sequences by the 
deletion or replacement of at least 2 amino acids. 

29. The variegated nucleic acid library of claim 26, wherein the monobody 
loop region sequences vary from the wild-type Fn3 loop region sequences 
by the insertion of from 3 to 25 amino acids. 

30. Hie variegated nucleic acid library of claim 26, wherein a variegated 
nucleic acid sequence comprising from 6 to 75 nucleic acid bases is 
inserted in any one of the loop regions of the species. 

3 1 . The variegated nucleic acid library of claim 26, wherein the variegated 
sequence is constructed so as to avoid one or more codons selected from 
the group consisting of those codons encoding cysteine or the stop codon. 

32. The variegated nucleic acid library of claim 26, wherein the variegated 
nucleic acid sequence is located in the BC loop. 

33. The variegated nucleic acid library of claim 26, wherein the variegated 
nucleic acid sequence is located in the DE loop. 

34. The variegated nucleic acid library of claim 26, wherein tlie variegated 
nucleic acid sequence is located in the FG loop. 

35. The variegated nucleic acid library of claim 26, wherein the variegated 
nucleic acid sequence is located in the AB loop. 



36. 



The variegated nucleic acid library of claim 26, wherein the variegated 
nucleic acid sequence is located in the CD loop. 
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37. The veiriegated nucleic acid library of claim 26, wherein the variegated 
nucleic acid sequence is located in the EF loop. 

38. A peptide display library derived from the variegated nucleic acid library 
of claim 26. 

39. A peptide display library of claim 38, wherein the peptide is displayed on 
the surface of a bacteriophage or virus. 

40. A peptide display library of claim 39, wherein the bacteriophage is M13 or 
fd. 

41 . A method of identifying the amino acid sequence of a polypeptide 
molecule capable of binding to a specific binding partner (SBP) so as to 
form apolypeptide-.SSP complex wherein the dissociation constant of the 
the polypeptide:SBP complex is less than lO'* molesAiter, comprising the 
steps of: 

a) providing a peptide display library according to claim 39; 

b) contacting the peptide display library of (a) with an immobihzed or 
separable SBP; 

c) separating the peptide:SBP complexes from the free peptides, 

d) causing the repHcation of the separated peptides of (c) so as to 
result m a new peptide display library distinguished from that in 
(a) by having a lowered diversity and by being enriched in 
displayed peptides capable of binding the SBP; 

e) optionally repeating steps (b), (c), and (d) with the new library of 
(d); and 

f) determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and deducing the peptide 
sequence capable of binding to the SBP. 

42. A method of preparing a variegated nucleic acid library encoding Fn3 
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polypeptide monobodies having a plurality of nucleic acid species each 
comprising aplwality of loop regions, wherein the species encode a 
plurality of Fn3 p-strand domain sequences that are linked to a plurality of 
loop region sequences, wherein one or more of the loop region sequences 
vary by deletion, insertion or replacement of at least two amino acids from 
corresponding loop region sequences in wild-type Fn3, and wherein the p- 
strand domain sequences of the monobody have at least a 50% total amino 
acid sequence homology to the corresponding amino acid sequences of p- 
strand domain sequences of the wild-type Fn3, and wherein the Fn3 
comprises a stabihzing mutation p-strand domain, comprising the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence; 

b) contacting the polypeptide with a specific binding partner (SBP) so 
as to form apolypeptiderSSP complex wherein the dissociation 
constant of the the polypeptide: SBP complex is less than 1 0"^ 
moles/liter; 

c) determining the binding structure of the polypeptide: SBP complex 
by nuclear magnetic resonance spectroscopy or X-ray 
crystallography; and 

d) preparing the variegated nucleic acid library, wherein the 
variegation is performed at positions in the nucleic acid sequence 
which, from the information provided in (c), result in one or more 
polypeptides with improved binding to the SBP. 

43. A method of identifying the amino acid sequence of a polypeptide 

molecule capable of catalyzing a chemical reaction with a catalyzed rate 
constant, k^t, and an uncatalyzed rate constant, k„„^3(, such that the ratio of 
kc3t^ncat is greater than 10, comprising the steps of: 

a) providing a peptide display library according to claim 39; 

b) contacting the peptide display library of (a) with an immobilized or 
separable transition state analog compound (TSAC) representing 
the approximate molecular transition state of the chemical 



wo 02/04523 



PCT/USOl/21855 



98 

reaction; 

c) separating the peptide:TSAC complexes from the free peptides; 

d) causing the replication of the separated peptides of (c) so as to 
result in a new peptide display library distinguished from that in 
(a) by having a lowered diversity and by being enriched in 
displayed peptides capable of binding the TSAC; 

e) optionally repeating steps (b), (c), and (d) with the new library of 
(d); and 

f) determining the nucleic acid sequence of the region encoding the 
displayed peptide of a species from (d) and hence deducing the 
peptide sequence. 

44. A method of preparing a variegated nucleic acid library encoding Fn3 
polypeptide monobodies having a plurality of nucleic acid species each 
comprising a plurality of loop regions, wherein the species encode a 
plurality of Fn3 p-strand domain sequences that are linked to a plurality of 
loop region sequences, wherein one or more of the loop region sequences 
vary by deletion, insertion or replacement of at least two amino acids from 
corresponding loop region sequences in wild-type Fn3, and wherein the p- 
strand domain sequences of the monobody have at least a 50% total amino 
acid sequence homology to the correspondmg amino acid sequences of p- 
strand domain sequences of the wild-type Fn3, and wherein the Fn3 
comprises a stabilizing mutation p-strand domam, comprising the steps of 

a) preparing an Fn3 polypeptide monobody having a predetermined 
sequence, wherein the polypeptide is capable of catalyzing a 
chemical reaction with a catalyzed rate constant, k^^, and an 
uncatalyzed rate constant, k^^at^ such that the ratio of k^aAuncat is 
greater than 10; 

b) contacting the polypeptide with an immobilized or separable 
transition state analog compound (TSAC) representing the 
approximate molecular transition state of the chemical reaction; 

c) determining the binding structure of the polypeptideiTSAC 
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complex by nuclear magnetic resonance spectroscopy or X-ray 
crystallography; and 
d) preparing the variegated nucleic acid library, wherein the 

variegation is performed at positions in the nucleic acid sequence 
which, from the information provided in (c), result in one or more 
polypeptides with improved binding to or stabilization of the 
TSAC. 

45. An isolated polypeptide identified by the method of claim 41 . 

46. An isolated polypeptide identified by the method of claim 43 . 

47. A kit for identifying the amino acid sequence of a polypeptide molecule 
capable of binding to a specific binding partner (SBP) so as to form a 
poIypeptide:SSP complex wherein the dissociation constant of the the 
polypeptide: SBP complex is less than 10'^ moles/liter, comprising the 
peptide display library of claim 39. 

48. A kit for identifying the amino acid sequence of a polypeptide molecule 
capable of catalyzing a chemical reaction with a catalyzed rate constant, 
k^^^, and an uncatalyzed rate constant, k^^„ such that the ratio ofk^^^fk^^^^^ 
is greater than 10, comprising the peptide display library of claim 39. 

49. A polypeptide derived by using the kit of claim 47. 

50. A polypeptide derived by using the kit of claim 48. 



, His PA6I BLANK (JSfTOi 



l/26 



FIG. 1A na IB 




FIG. 1C 



FIG. ID 



WO 02/04523 

2/26 



EcoRI 

Ndel ^- 31 41 

1 t.TT^^-DrTmm YYRmSzETG GNSPVQEm 

A B 

sad Xhol 

71 



FIG. 2 



3/26 












1 \ 1 










B - 


f 


1 , 


1 







210 220 230 240 250 
wavelength (nm) 



260 



30 40 50 60 70 80 90 
Temperature (**C) 



FIG. 3A 



FIG. 3B 



PCTAJSOl/21855 



WO 02/04523 

4/26 




FIG. 4C 



FIG. 4-D 



5/26 



Kdel 

CATATGCAGGTTTCTGATGTTCCGCGTGACCTGGAAGTTGTTGCTGCGACCCCGACTAGC 
MetGlnVaXSerAfipValProArgAspLeuGluValValAXaAlaThrProThrSer 
-2-11 10 

Bell PvuH PstI BsiWI 



CTGCTGATCAGCTGGGATGCTCCT 3CAGTTACCGTGCGT rATTACCGTATCACGTACGGT 
T.#.uT.etirieSerTrnAspAlAPiro |AlaValThrValArqb yrTyrArgIleThrTyrGly 



20 30 

ECOHX 

GAAACCGGTGGTAACTCCCCGGTTCAGGAATTCACTGTACCTGGTTCCAAGTCTACTGCT 
GluThrGlyGlyAsnSerProValGlnGluPheThrValProGlySerl/ysSerThrAaa 
40 50 

Sail BstnOTI 



ACCATCAGCGGCCTGAAACCGGGTGTCGACTATACCATCACTGTATACGCTGTTACT GGC 
ThrIleSerGlyLeuLysProGlyValAspTyrThrIleThrValTyrAlaValThi|Glir 
60 70 



sad Xhol 



CGTGGTGACAGCCCAGCGAGCjrCCAAGCCAATCTCGATTAACTACCGTACCTAGTAACTC 
ArgGlyAspSerProAlaSer SerLysProIleSerlleAsnTyrArgThr 
[ 80 90 



FIG. 5 



PCT/USOl/21855 

WO «2/(»4523 

6/26 




FIG. 6 



7/26 





RG. 7 



wo (»2/04523 



8/26 



PCT/USOl/21855 




FIG. 8 



9/26 



phage EUSA (ubiquitin) 



0.4- 



OJ- 



0 Ligand (+) 

1 Cootrol 



0.1- 




I 



4 



aone# 



FIG. 9 




wo (»2/04523 



10/26 



PCT/USOl/21855 




RG. 11 




FIG. 12 



12/26 




FIG. 13 



13/26 




r^: • •Vi •^*:■ -■; 



.JO. 

0 :^ 



104.0 



106.0 



- 10S.0 



-110.0 



-112.0 



-1J4.D E 

a. 

116.0 

CO 

118.0 g 

1 

a> 

120.0 

O 



122.0 !i2 



124.0 



-126.0 



-128.0 



-130X) 



9.8 9.6 9.4 9^ 9.0 8.8 8.6 8.4 8.2 8.0 7.8 7.6 7.4 7.2 7.0 6.8 6.6 6.4 6.2 6.0 

Chemical Shflt (ppm) 



FIG. 14 



wo U2/(I4?23 



14/26 




in 



o 



Buipuiq % 



^6B^d IM lO 



CO 

d 



-s — ^ 

d o 



- (-) PUB61.1 mn 



15/26 




wo 02/04523 



16/26 



PCIVlJSOl/21855 




RG. 16 



17/26 



110 



E 

Q. 

CL 

!^ 

Ic 
CO 

J 120 
£ 

0) 

O 



o o 

o 

o 



O ' ^^^^ 



o 

o . 

O 



130 



10 



9 8 7 

■"H Chemical Shift (ppm) 



RG. 17A 



wo «2/()4523 



18/26 



PCT/LSOl/21855 




QQ 
O 



(ujdd) H2V 



19/26 




(ludd) N9V 



wo 02/04523 



FCl/Ub01/21«:>5 




.21/26 




FIG. 19 



wo (*2/()4523 



PCT/USOl/21855 




FC17LS01/21855 



t 

I 



23/26 




FIG. 21 



wo 02/04523 PCT/USOl/21855 

24/26 




Temperature fC) 



FIG. 22 



WU 02/()4?2J FC r/USUl/2185S 

25/26 




FIG. 23 



wo 02/04523 



PCT/USOl/21855 



26/26 



180 



179 



178 



+0 



o 



"1 r 



1 r 



+ 



177 



1801^ 



179 



178 

Q. 
Q- 

^177 

'sz 
CO 

"c5 
o 

E 180 - 

0) 

sz 
O 

O 179 

CO 



A. D3 

-\ \ \ Ht^ 

o * o 



o • 



B. D23 



178 - 



177 

176 
180 

179 

178 

177 



+• 



C. D67 



183 - 
182 
181 

180 

183 

182 

181 

180 
184 
183 
182 
181 
180 



+ 

L ^ + • ° 



— i w+ 

• + 

o 



o 



E. E9 



+ 
3 



+ 



+ 0 

• 

+ o 



F. E38 



+ 

o 



J L 



D. D80 
1 ' 



+ 
ft 



+ 
8 



+ 0 



G. E47 



2 3 4 5 6 
pH 



FIG. 24 



4 5 
pH 



] 



l'^^l/u^lM/zl«^^ 



SEQUENCE LISTING 

<110> Research Corporation Technologies, Inc. 
5 Koide, Shohei* 

<120> ARTIFICIAL ANTIBODY POLYPEPTIDES . 

<130> 109.050WO1 

10 

<150> US 60/217,474 
<151> 2000-07-11 

<160i* 121 

15 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 14 
2 0<212> PRT 
<213> Unknown 

<220> 

<223> Anti-hen egg lysozyme (HEL) antibody. 

25 

<400> 1 

Ala Arg Glu Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly 
15 10 

30<210> 2 
<211> 17 
<212> PRT 
<213> Unknovm 

35<220> 

<223> An anti-HEL single VH domain termed VH8 , 
<400> 2 

Ala Arg Gly Ala Val Val Ser Tyr Tyr Ala Met Asp Tyr Trp Glv Gin 
40 1 5 10 15 

Gly 
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<210> 3 
<211> 16 
5<212> PRT 
<213> Homo sapiens 



<400> 3 

' Tyr Ala Val Thr Gly Arg Gly Asp Ser Pro Ala Ser Ser Lys Pro He 

c 10 15 

10 1 5 



<210> 4 
<211> 12 
<212> PRT 
15<213> Artificial Sequence 

<220> 

<223> Mutant Dl-3-1. 
20<400> 4 

Tyr Ala Glu Arg Asp Tyr Arg Leu Asp Tyr Pro He 
1 



5 10 



<210> 5 
25<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

30<223> Mutant Dl.3-2. 



<400> 5 

Tyr Ala Val Arg Asp Tyr Arg Leu Asp Tyr Pro He 



1 
35 

<210> 6 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

40 



5 10 



J/ci:5Ui/zi«:>:^ 



3 

<220> 

<223> Mutant Dl.3-3. 
<400> 6 

5Tyr Ala Val Arg Asp Tyr Arg Leu Asp Tyr Ala Ser Ser Lys Pro lie 
15 10 15 

<210> 7 
<211> 13 
10<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Mutant Dl.3-4. 
15 ' 
<400> 7 

Tyr Ala Val Arg Asp Tyr Arg Leu Asp Tyr Lys Pro lie 
1 5 10 

20<210> 8 

<211> 11 ' 
<212> PRT 

<213> Artificial Sequence 

25<220> 

<223> Mutant Dl.3-5- 

<400> 8 

Tyr Ala Val Arg Asp Tyr Arg Ser Lys Pro lie 
30 1 5 10 

<210> 9 
<211> 14 
<212> PRT 
35<213> Artificial Sequence 

<220> 

<223> Mutant Dl.3-6. 
40<400> 9 
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Tyr Ala Val Thr Arg Asp Tyr Arg Leu Ser Ser Lys Pro lie 

c 10 

1 ^ 



<210> 10 
5<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

10<223> Mutant Dl.3-7. 
<400> 10 



Tyr Ala Val Thr Glu Arg Asp Tyr Arg Leu Ser Ser Lys Pro lie 



1 
15 

<210> 11 
<211> 15 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> Mutant VH8-1. 



5 10 15 



ssi^rill'val Ala Val Val Ser Tyr Tyr Ala Met Asp Tyr Pro He 

10 1^ 



1 5 



<210> 12 
<211> 16 
30<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Mutant VH8-2. 

35 

<400> 12 



^ Ila val Thr Ala Val Val Ser Tr^r Tyr Ala Ser Ser Lys Pro He 

c 10 1^ 

1 ^ 



40<210> 13 
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5 

<211> 59 
<212> DNA 

<213> Artificial Sequence 
5<220> 

<223> Oligonucleotide FNIF. 
<400> 13 

cgggatccca tatgcaggtt tctgatgttc cgcgtgacct ggaagttgtt gctgcgacc 59 

10 

<210> 14 

<211> 55 . 
<212> DNA 

<213> Artificial Sequence 
15 i 
<220> 

<223> Oligonucleotide FNIR. 

f 

<400> 14 

20taactgcagg agcatcccag ctgatcagca ggctagtcgg ggtcgcagca acaac 55 

<210> 15 
<211> 51 
<212> DNA 
25<213> Artificial Sequence 

<220> 

<223> Oligonucleotide FN2F. 
30<400> 15 

ctcctgcagt taccgtgcgt tattaccgta tcacgtacgg tgaaaccggt g 51 

<210> XS 
<211> 39 
35<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide FN2R. 

40 
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6 

<400> 16 

gtgaattcct gaaccgggga gttaccaccg gtttcaccg 

<210> 17 
5<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

10<223> Oligonucleotide FN3F. 
<400> 17 

aggaattcac tgtacctggt tccaagtcta ctgctaccat cagcgg 

15<210> 18 
<211> 38 
<212> DNA 

<213> Artificial Sequence 

20<220> 

<223> Oligonucleotide FN3R. 

<400> 18 

gtatagtcga cacccggttt caggccgctg atggtagc 

25 

<210> 19 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> Oligonucleotide FN4F. 
<400> 19 

35cgggtgtcga ctataccatc actgtatacg ct 

<210> 20 
<211> 55 
<212> DNA 
40<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide FN4R. 
<400> 20 

Scgggatccga gctcgctggg ctgtcaccac ggccagtaac agcgtataca gtgat 55 

<210> 21 
<211> 35 
<212> DNA 
10<213> Artificial Sequence 

<220> 

<223> Oligonucleotide FN5F. 
15<400> 21 

cagcgagctc caagccaatc tcgattaact accgt 35 

<210> 22 
<211> 37 
2 0<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide FN5R. 

25 

<400> 22 

cgggatcctc gagttactag gtacggtagt taatcga 37 

<210> 23 
30<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

35<223> Oligonucleotide FN5R' - 
<400> 23 . * 

cgggatc^ac gcgtgccacc ggtacggtag ttaatcga 38 



40<210> 24 
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<211> 44 
<212> DWA 

<213> Artificial Sequence 
5<220> 

<223> Oligonucleotide gene3F. 
<400> 24 

cgggatccac gcgtccattc gtttgtgaat atcaaggcca atcg 

10 

<210> 25 
<211> 39 
<212> DNA 

<213> Artificial Sequence 

15 

<220> 

<223> Oligonucleotide gene3R. 
<400> 25 

2 0ccggaagctt taagactcct tattacgcag tatgttagc 

<210> 26 
<211> 36 
<212> DNA 
25<213> Artificial Sequence 

<220> 

<223> Oligonucleotide 3 8TAABglII. 

30<400> 26 

ctgttactgg ccgtgagatc taaccagcga gctcca 

<210> 27 
<211> 51 
35<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide BC3 . 
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<221> misc^f eat\ire 
<222> {!)... (51) 
<223> n = A,T,C or G 

5<400> 27 

gatcagctgg gatgctcctn nknnJamknn lomktattac cgtatcacgt a 

<210> 28 
<211> 57 
10<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide FG2. 

15 

<221> inisc^f eature 
<222> (1) . . . (57) 
<223> n = A,T,C or G 

20<400> 28 

tgtatacgct gttactggcn nknnknnkrm lamknnknnk tccaagccaa tctcgat 

<210> 29 
<211> 47 
25<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide FG3 . 

30 

<221> misc_feature 
<222> (1) . . . (47) 
<223> n = A,T,C or G 

35<400> 29 

ctgtatacgc tgttactggc nnknnknnkn nkccagcgag ctccaag 

<210> 30 
<211> 51 
40<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Oligonucleotide FG4. 

5 

<221> misc_f eature 
<222> (1)..-(51) 
<223> n = A,T,C or G 



10<400> 30 

catcactgta tacgctgtta ctxmlamlam kxmknnlctcc aagccaatct 



<210> 31 
<211> 5 
15<212> PRT 

<213> Artificial Sequence 

<220> , . J. 

^^^= EC loop of ubiquitin-bmdmg 
<223> The sequence or tne xu^jj ^ 

20 monobody clone 211. 

<400> 31 

Cys Ala Arg Arg Ala 
1 5 

25 

<210> 32 
<211> 7 
<212> PRT 

<213> Artificial Sequence 

30 

<220> ^. ^. 

<223> The sequence of the FG loop of ubiquitin-b.nd.ng 

monobody clone 211. 



35<400> 32 

Arg Trp He Pro Leu Ala Lys 



<210> 33 
40<211> 5 
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<212> PRT 

<213"> Artificial Sequence 



<220> 

5<223> The sequence of the BC loop of ubi qui tin -binding 
monobody clone 212. 

*<400> 33 

Cys Trp Arg Arg Ala 
10 1 5 

<210> 34 
<211> 7 
<212> PRT 
15<213> Artificial Sequence 



<220> 

<223> The sequence of the FG loop of ubiqui tin-binding 
monobody clone 212. 

20 

<400> 34 

Arg Trp Val Gly Leu Ala Trp 
• 1 5 

25<210> 35 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



30<220> 

<223> The sequence of the BC loop of ubiqui tin -binding 
monobody clone 213, 



<400> 35 
35Cys Lys His Arg Arg 
1 5 



<210> 36 
<211> 7 
40<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> The sequence of the FG loop of uhi qui tin-binding 
5 monobody clone 213. 

<400> 36 

Phe Ala Asp Leu Trp Trp Arg 
1 5 

10 

<210> 37 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

15 

<220> 

<223> The sequence of the BC loop of ubi qui tin -binding 
monobody clone 214. 

20<400> 37 

Cys Arg Arg Gly Arg 
1 s 

<210> 38 
25<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

30<223> The sequence of the FG loop of ubiqui tin -binding 
monobody clone 214. 

<400> 38 

Arg Gly Phe Met Trp Leu Ser 
35 1 5 

<210> 39 
<211> 5 
<212> PRT 
40<213> Artificial Sequence 
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<220> 

<223> The sequence of the BC loop of ubi quit in -binding 
monobody clone 215. 

5<400> 39 
Cys Asn Trp Arg Arg 
15* 

<210> 40 
10<211> 7 
<212> PRT 

<213> Artificial Sequence 
<22 0> 

15<223> The sequence of the FG loop of ubi qui tin -binding 
monobody clone 215. 

<400> 40 

Arg Ala Tyr Arg Tyr Arg Trp 
20 1 5 

<210> 41 ^ 
<211> 5 
<212> PRT 
25<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of ubi qui tin -binding 
monobody clone 411. 

30 

<400> 41 

Ser Arg Leu Arg Arg 
1 5 

35<210> 42 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



40<220> 
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<223> The sequence of the FG loop or h 
monobody clone 411. 



<400> 42 
5Pro Pro Trp Arg Val 



<210> 43 
<211> 5 
10<212> PRT 

<213> Artificial Sequence 

i220> 



^ of 1-he BC loop of ubiquitin-binding 
<223> The sequence of tne ±5^^ j-^^f 

15 monobody clone 422 . 



<400> 43 

Ala Arg Trp Thr Leu 
1 5 

20 

<210> 44 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

25 

r.^ ^-F t-he FG loop of ubiquitm-bmding 
<223> The sequence of the -i-uup 

monobody clone 422. 

30<400> 44 

Arg Arg Trp Trp Trp 
1 5 

<210> 45 
35<211> 5 
<212> PRT 

<213> Artificial Sequence 



""^^^^ . r.^ of the BC loop of ubiquitin-binding 

40<223> The sequence of the -toup 
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monobody clone 424 . 



<400> 45 

Gly Gin Arg Thr Phe 
5 1 5 



<210> 46 
<211> 5 
<212> PRT 
10<213> Artificial Sequence 



<220> 

<223> The sec[uence of the FG loop of ubi qui tin -binding 
monobody clone 424. 

15 

<400> 46 

Arg Arg Trp Trp Ala 
1 5 

20<210> 47 
<211> 5 
<212> PRT 
<213> Unknown 

25<220> 

<223> The sequence of the BC loop of WT from library #2, 
<400> 47 

Ala Val Thr Val Arg 
30 1 5 



<210> 48 
<211> 7 
<212> PRT 
35<213> Unknown 



<220> 

<223> The sequence of the FG loop of WT from library #2 - 



40<400> 48 
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Arg Gly Asp Ser Pro Ala Ser 
1 5 

<210> 49 
5<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

10<223> The sequence of the BC loop of clone pLB24.1, 
<400> 49 

Cys Asn Trp Arg Arg 
1 5 

15 

<210> 50 
<211> 7 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> The sequence of the FG loop of clone pI,B24,l. 

<400> 50 
2 5 Arg Ala Tyr Arg Tyr Arg Trp 
1 5 

<210> 51 
<211> 5 
30<212> PRT 

<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of clone pLB24.2. 

35 

<400> 51 

Cys Met Trp Arg Ala 
1 5 



40<210> 52 
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<211> 7 
<212> PRT 

<213> Artificial Sequence 
5<220> 

<223> The sequence of the FG loop of clone pLB24 
<400> 52 

Arg Trp Gly Met Leu Arg Arg 
10 1 5 

<210> 53 
<211> 5 
<212> PRT 
15<213> Artificial Sequence 

' <220> 

<223> The sequence of the BC loop of clone pLB24 

20<400> 53 

Ala Arg Met Arg Glu 
1 5 

<210> 54 
25<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

30<223> The sequence of the FG loop of clone pLB24 
<400> 54 

Arg Trp Leu Arg Gly Arg Tyr 
1 5 

35 

<210> 55 
<211> 5 
<212> PRT 

<213> QArtificial Sequence 
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<220> 

<223> The sequence of the BC loop of clone pLB24.4. 

<400> 55 
5Cys Ala Arg Arg Arg 
1 5 

<210> 56 
- <211> 7 
10<212> PRT 

<213> Artificial Sequence 

<220> 

<223> The sequence of the FG loop of clone pLB24,4. 

15 

<400> 56 

Arg Arg Ala Gly Trp Gly Trp 

1. 5 

. 20<210> 57 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
25<220> 

<223> The sequence of the BC loop of clone pLB24.5 
<4O0> 57 

Cys Asn Trp Arg Arg 
30 1 5 

<210> 58 
<211> 7 
<212> PRT 
35<213> Artificial Sequence 

<220> 

<223> The sequence of the FG loop of clone pLB24 , 



40<400> 58 
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. Arg Ala Tyr Arg Tyr Arg Trp 
1 5 

<210> 59 
5<211> 5 
<212> PRT 

<213> Artificial Sequence 
"<220> 

10<223> The secjuence of the BC loop of clone pLB24.6. 
<400> 59 

Arg Trp Arg Glu Arg 
1 5 

15 

<210> 60 
<211> 7 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> The seqiience of the FG loop of clone pLB24.6. 

<400> 60 
2 5 Arg His Pro Trp Thr Glu Arg 
1 5 

<210> 61 

<211> 5 

30<212> PRT 

<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of clone pLB24.7. 

35 

<400> 61 

Cys Asn Trp Arg Arg 
1 5 

40<210> 62 



wo 02/04523 



20 

<211> 7 
<212> PRT 

<213> Artificial Sequence 
5<220> 

<223> The sequence of the FG loop of clone pLB24.7. 
<400> 62 

Arg Ala Tyr Arg Tyr Arg Trp 
10 1 ^ 

<210> 63 
<211> 5 
<212> PRT 
15<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of clone pLB24.8 

20<400> 63 

Glu Arg Arg Val Pro 
1 5 

<210> 64 
25<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

30<223> The sequence of the FG loop of clone pLB24. 
<400> 64 

Arg Leu Leu Leu Trp Gin Arg 
1 5 

35 

<210> 65 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> The sequence of the BC loop of clone pLB24.9. 

<400> 65 
5Gly Arg Gly Ala Gly 
1 5 

i 

<210> 66 
<211> 7 
10<212> PRT 

<213> Artificial , Sequence 



<220> 

<223> The sequence of the FG loop of clone pLB24.9. 

15 

<400> 66 

Phe Gly Ser. Phe Glu Arg Arg 
1 5 

20<210> 67 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
25<220> 

<223> The sequence of the BC loop of clone pLB24.11, 



<400> 67 

Cys Arg Trp Thr Arg 
30 1 5 



<210> 68 
<211> 7 
<212> PRT 
35<213> Artificial Sequence 



<220> 

<223> The sequence of the FG loop of clone pLB2'l.ll. 



40<400> 68 
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Arg Arg Trp Phe Asp Gly Ala 
1 - 5 

<210> 69 
5<211> 5 
<212> PRT 

<213> Artificial Sequence 



<220> 

10<223> The sequence of the BC loop of clone pLB24.12. 
<400> 69 

Cys Asn Trp Arg Arg 
1 5 

15 

<210> 70 
<211> 7 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> The sequence of the FG loop of clone pLB24.12 



<400> 70 
25Arg Ala Tyr Arg Tyr Arg Trp 
1 5 



<210> 71 
<211> 5 
30<212> PRT 
<213> Unknown 



<220> 

<223> The sequence of the 
<400> 71 

Ala Val Thr Val Arg 
1 5 



BC loop of WT from library #4. 



40<210> 72 



l'CJ/u^uJ/2l«^^ 



23 

<211> 5 
<212> PRT 
<213> Unknown 

5<220> 

<223> The sequence of the FG loop of WT from library #4. 
<400> 72 

Gly Arg Gly Asp Ser ^-^ 
10 1 5 

<210> 73 
<211> .5 
<212> PRT 
15<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of clone pLB25.1- 

20<400> 73 

Gly Gin Arg Thr Phe 
1 5 

<210> 74 
25<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

30<223> The sequence of the FG loop of clone pLB25.1. 
<400> 74 

Arg Arg Trp Trp Ala 
1 5 

35 

<210> 75 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

40 



wo 02/04523 
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<220> 

<223> The sequence of the BC loop of clone pLB25.2. 



<400> 75 
5Gly Gin Arg Thr Phe 
1 5 

<210> 76 
<211> 5 
10<212> PRT 

<213> Artificial Sequence 

<220> 

<223> The sequence of the FG loop of clone pLB25.2 

15 

<400> 76 

Arg Arg Trp Trp Ala 
1 5 

20<210> 77 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



25<220> 

<223> The sequence of the BC loop of clone pIiB25. 



<400> 77 

Gly Gin Arg Thr Phe 
30 1 5 

<210> 78 
<211> 5 
<212> PRT 
35<213> Artificial Sequence 

<220> 

<223> The sequence of che FG loop of clone pLB25 



40<400> 78 
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Arg Arg Trp Trp Ala 
1 5 

<210> 79 
5<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

10<223> The sequence of the BC loop of clone pIjB25.4. 
<400> 79 

Leu Arg Tyr Arg Ser 
1 5 

15 

<210> 80 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> The sequence of the FG loop of clone pIiB25.4. 

<400> 80 
25Gly Trp Arg Trp Arg 
1 5 

<210> 81 
<211> 5 
30<212> PRT 

<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of clone pLB25.5. 

35 

<400> 81 

Gly Gin Arg Thr Phe 
1 5 

40<210> 82 



wo 02/04523 
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<211> 5 
<212> PRT 

<213> Artificial Sequence 



5<220> 

<223> The sequence of the FG loop of clone pLB25,5, 



<400> 82 

Arg Arg Trp Trp Ala 
10 1 ^ 



<210> 83 
<211> 5 
<212> PRT 
15<213> Artificial Sequence 



<220> 

<223> The sequence of the BC loop of clone pLB25.6 



20<400> 83 

Gly Gin Arg Thr Phe 
1 5 

<210> 84 
25<211> 5 
<212> PRT 

<213> Artificial Sequence • 
<220> 

3 0<223> The sequence of the FG loop of clone pLB25. 
<400> 84 

Arg Arg Trp Trp Ala 
1 5 

35 

<210> 85 
<211> 5 
<2X2> PRT 

<213> Artificial Sequence 
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<220> 

<223> The sequence of the BC loop of clone pLB25.7. 

<400> 85 
5Leu Arg Tyr Arg Ser 
1 5 

<210> 86 
<211> 5 
10<212> PRT 

<213> Artificial' Sequence 

<220> 

<223> The sequence of the FG loop of clone pLB25,7. 

15 

<400> 86 

Gly Trp Arg Trp Arg 
1 5 

20<210> 87 
<211> 5 
<212> PRT 

<213> Artificial Sec[uence 
25<220> 

<223> The sequence of the BC loop of clone pLB25.9. 
<400> 87 

Leu Arg Tyr Arg Ser 
30 1 5 

<210> 88 
<211> 5 
<212> PRT 
35<213> Artificial Sequence 

<220> 

<223> The sequence of the FG loop of clone pIjB25-9. 
40<400> 88 
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Gly Trp Arg Trp Arg 
1 5 . 

<210> 89 
5<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

10<223> The sequence of the BC loop of clone pLB25.ia 
<400> 89 

Gly Gin Arg Thr Phe 
1 5 

15 

<210> 90 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> The sequence of the FG loop of clone pIiB25.1 

<400> 90 
25Arg Arg Trp Trp Ala 
1 5. 

<210> 91 
<211> 5 
30<212> PRT 

<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of clone pLB25. 

35 

<400> 91 

Leu Arg Tyr Arg Ser 
1 5 

40<210> 92 
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<212> PRT 

<213> Artificial Sequence 
5<220> 

<223> The sec[uence of the FG loop of clone pLB25.12. 



<400> 92 

Gly Trp Arg Trp Arg 
10 1 5 



<210> 93 

<211> 15 . 

<212> DNA 

15<213> Unknown 

<220> 

<223> The sequence of the BC loop of WT from Table 7. 

20<400> 93 

gcagttaccg tgcgt 

<210> 94 
<211> 5 • 
25<212> PRT 
<213> Unknown 

<220> 

<223> The sequence of the BC loop of WT from Table 7. 

30 

<400> 94 

Ala Val Thr Val Arg 
1 5 



35<210> 95 
<211> 24 
<212> DHA 
<213> Un3aiown 



40<220> 
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30 

of the FG loop of WT from Table 7 . 



<400> 95 

ggccgtggtg acagcccagc gage 

5 

<210> 96 
<211> 8 
<212> PRT 
<213> Unknown 

10 

<220> 

t-Vi^ FG loop of WT from Table 7, 
<223> The sequence of the J?u xoop 

<400> 96 

15Gly Arg Gly Asp Ser Pro Ala Ser 
1 5 



<210> 97 
<211> 15 
20<212> DNA 

<213> Artificial Sequence 



<220> 



r^f the BC loop of clone 1 from Table 
<223> The sequence of tne js^- J-^up 

25 7. 



<400> 97 

tcgaggttgc ggcgg 



30<210> 98 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



35<220> 



<223> The sequence of the BC loop of clone 1 from Table 



7 



<400> 98 ' 
40 Ser Arg Leu Arg Arg 
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<210> 99 

<211> 15 

5<2i2> DNA 

<213> Artificial Sequence 

*<220> 

<223> The sequence of the FG loop of clone 1 from Table 

10 7. 



<400> 99 
ccgccgtgga.. gggtg 



15<210> 100 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



20<220> 

<223> The sequence of the FG loop of clone 1 from Table 
7. 

<400> 100 
2 5 Pro Pro Trp Arg Val 

1-5 



<210> 101 

<211> 15 

30<212> DNA 

<213> Artificial Sequence 

<220> 

<223> The sequence of the BC loop of clone 2 from Table 

35 7- 



<400> 101 
ggtcagcgaa ctttt 



40<210> 102 
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<211> 5 
<212> PRT 

<213> Artificial Sequence 



5<220> 

^-F fhf- BC loop of clone 2 from Table 
<223> The sequence of tne xoop 



7 



<400> 102 
lOGly Gin Arg Thr Phe 
1 5 

<210> 103 
<211> 15 
15<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 
20 7 



<220> 

<223> The sequence of the FG loop of clone 2 from Table 



<400> 103 

aggcggtggt gggct 

25<210> 104 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
30<220> 

<223> The sequence of the FG loop of clone 2 from Table 
7. 

<400> 104 
35Arg Arg Trp Trp Ala 
1 5 



<210> 105 
<211> 15 
40<212> DNA 



wo 02/04523 



<213> Artificial Sequence 
<220> 

<223> The sequence of the BC loop 
5 7, 

<400> 105 
gcgaggtgga cgctt 

10<210> 106 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
15<220> 

<223> The sequence of the BC loop 
7. 

<400> 106 
20 Ala Arg Trp Thr Leu 
1 5 

<210> 107 
<211> 15 
25<212> DNA 

<213> Artificial Sequence 

<220> 

<223> The sequence of the FG loop 
30 7. 

<400> 107 
aggcggtggt ggtgg 

35<210> 108 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
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of clone 3 from Table 



15 



of clone 3 from Table 



of clone 3 from Table 



40<220> 
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<223> The sequence 
7. 



34 

of the FG loop of clone 3 from Table 



<400> 108 
5Arg Arg Trp Trp Trp 
1 5 

<210> 109 
<211> 5 
10<212> PRT 

<213> Artificial Sequence 

<220> 

<223> A solubility tail- 

15 

<4O0> 109 

Gly Lys Lys Gly Lys 
1 5 

20<210> 110 
<211> 96 
<212> PRT 

<213> Artificial Sequence 
25<220> 

<223> The synthetic Fn3 gene. 



''''' . . ... Val pro Arg Asp I^eu Glu Val Val Ala Ala Thr 

Met Gin Val Ser Asp Vai Pro Arg i^^y 

5 10 

^ . T-i. q.v Tm ASP Ala Pro Ala Val Thr Val Arg 

Pro Thr Ser Leu Leu lie Ser Trp Asp a^ci 

20 



^ «g lie Thr Tyr 31y Gl« Thr Gly Gly s« Pro V.1 Gin 
3SG1„ Ph. L V.1 Pro «ly a=r ser Thr Ala Thr He ser Gly Leu 
.y. 1 Oly V.1 ASP xyr L lie Thr val Tyr .la Val Thr Gly ^3 



Gly Aap Ser Pro Ala Ser Ser Lys Pro He ser He Ash Tyr Arg Thr 
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<210> 111 
<211> 308 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> The designed Fn3 gene. 



<400> 111 






lOcatatgcagg 


tttctgatgt tccgcgtgac ctggaagttg ttgctgcgac cccgactagc 


60 


ctgctgatca 


gctgggatgc tcctgcagtt accgtgcgtt attaccgtat cacgtacggt 


120 


gaaaccggtg 


gtaactcccc ggttcaggaa ttcactgtac ctggttccaa gtctactgct 


180 


accatcagcg 


gcctgaaacc gggtgtcgac tataccatca ctgtatacgc tgttactggc 


240 


cgtggtgaca 


gcccagcgag ctccaagcca atctcgatta actaccgtac ctagtaactc 


300 


ISgaggatcc 




308 



<210> 112 

<211> 96 

<212> PRT 

20<213> Artificial Sequence 

<220> 

<223> The designed Fn3 gene. 



25<400> 112 

Met Gin Val Ser 
1 

Pro Thr Ser Leu 
20 

30Tyr Tyr Arg lie 
35 

Glu Phe Thr Val 
50 

Lys Pro Gly Val 
3565 

Gly Asp Ser Pro 



Asp Val Pro Arg 
5 

Leu lie Ser Trp 

Thr Tyr Gly Glu 
40 

Pro Gly Ser Lys 
55 

Asp Tyr Thr lie 
70 

Ala Ser Ser Lys 
85 



Asp Leu Glu Val 
10 

Asp Ala Pro Ala 
25 

Thr Gly Gly Asn 

Ser Thr Ala Thr 
60 

Thr Val Tyr Ala 
75 

Pro lie Ser lie 
90 



Val Ala Ala Thr 
15 

Val Thr Val Arg 
30 

Ser Pro Val Gin 
45 

lie Ser Gly Leu 

Val Thr Gly Arg 
80 

Asn Tyr Arg Thr 
95 



<210> 113 

40 
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<400> 113 
000 

<210> 114 
5<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

10<223> A fusion protein. 
<400> 114 

Met Gly ser Ser His His His His His His Ser Ser Gly Leu Val Pro 
1 5 
ISArg Gly Ser His 
20 

<210> 115 
<211> 10 
20<212> PRT 

<213> Artificial Sequence 

<220> 

<223> A sequence from clone Plb25.1. 

25 

<400> 115 

Gly Gin Arg Thr Phe Arg Arg Trp Trp Ala 
1 



10 . 15 



5 10 



30<210> 116 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
35<220> 

<223> A sequence from clone Plb25.4. 
<400> 116 

Leu Arg Tyr Arg Ser Gly Trp Arg Trp Arg 
40 1 ^ 
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<210> 117 
<211> 12 
<212> PRT 

<213>*- Artificial Sequence 

5 

<220> 

<223> A sequence from clone pLB24.1. 
<400> 117 

lOCys Asn Trp Arg Arg Arg Ala Tyr Arg Tyr Trp Arg 
15 10 

<210> 118 
<211> 12 
15<212> PRT 

<213> Artificial Sequence 

<220> 

<223> A sequence from clone pLB24.3. 

20 

<4P0> 118 

Ala Arg Met Arg Glu Arg Trp Leu Arg Gly Arg Tyr 
15 10 

25<210> 119 
<211> 4 
<212> PRT 
<213> Homo sapiens 

30<400> 119 

Glu lie Asp Lys 
1 

<210> 120 
35<211> 4 
<212> PRT 
<213> Unknown 

<220> 

40<223> Anti-hen egg lysozyme (HEL) antibody. 
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<400> 120 
Arg Asp Tyr Arg 
1 

5<210> 121 
<2X1> 96 
<212> PRT 

<213> Homo sapiens 



. v.l pro Axg ASP Leu Glu Val Val Ala Ala Thr 
Met Gin Val Ser Asp Val Pro Arg Asp 

10 



^ T qer TnD Asp Ala Pro Ala Val Thr Val Arg 

Pro Thr Ser Leu Leu He Ser Trp Asp «^ 

20 ^ _ 

l=Tyr Tyr Arg He Thr Tyr Gly Glu Thr Oly Gly S» ^al Gl. 

40 

01. Ph= 1 val Pro Gly s.r Lys s.r Thr Thr 11= S» Gly I,au 
.y. pro Gly val ^ Tyr Thr 11. Thr Val Tyr «a val Thr Gly 

75 

"gI sar pro «a Ter S„ .y» Pro He =ar lie Tyr «3 Thr 

90 ^ 

85 
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